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181 GGACCCGGCGGCTTTCCGCGCGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGC 
CCTGGGCCGCCGAAAGGCGCGCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCG 



NFkB_CSl 

GGGRQTYYQC 

NFkB-MHC- I . 2 

TGGGCTTCCCC 
************************** 

241 ACGGCCGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCG 
TGCCGGCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGC 

Intronl 

**************************************************** ir + i Cicifi(i( x 

3 01 GCTGGGGTTGAGGGCGGCCGGGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTC 
CGACCCCAACTCCCGCCGGCCCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAG 

NFkB_CSl 

GGGRQTYYQC 

NFkB_CS2 

RGGGRMTYYCC 

Topo_II__cleavage__site 

RNYNNCNNGYNGKTNYNY 
***************** > 

3 61 AGGGCGCTTCCCCCGCAGGTGTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGG 
TCCCGCGAAGGGGGCGTCCACAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCC 



FIG. 12 
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1 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT 
51 AACCTCAGTA TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA 
101 ATGGAGGTTG ATGTTGATAA TCAAGCTGAT AATCATGGCA TTCACTCAGC 
151 TCTTAAGACT TGTGAAGAAA TTAAAGAAGC TAAAACGTTG TACTCTTGGA 
201 TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA TTATAAAGAT 
251 TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 
301 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT 
351 CAACTGGACT AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT 
401 TCATCAAGCG ATGTTTCAGA TAGACAAAAA CTTCAATGAT TTGGATTTCA 
451 ACTTAAGGGA AAT C AAT TAG CAAAGACCCA TTTATTAACA GCTCTTTCAA 
501 CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT TAGAGCAATG 
551 ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 
601 GCGAACTTCT GAAGGAA C T C TTGTTCAATT TTGCGGGAAT AACGTTTTTG 
5 m n^FH^ AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA 
701 GCAGACATGA ATGAACCTCG ATGTTGATCA ACCTGCAAAT ACAATGTCAA 
751 GAATGAGAAA GATCACTTTC TCAACAACAT CAACGTGCCG AATTGGAATA 
801 ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT TAATAGAAAT 
851 AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 
901 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA 
951 GAATTAGAAA GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG 
1001 CTTGAGAAAG TCAAAGATTT TAACTTCAAC TACTATTTAA CAAAATCTTG 
1051 TCCTCTTCCA GAAAATTGGC GGGAACGGAA ACAAAAAATC GAAAACTTGA 
1101 TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA GCTGTTTAGC 
1151 TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 
12 01 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC 

12 51 AAAAGAAAGT TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC 

13 01 AAAAACTTAT TGCTTGAGAA GATCAATACA AGAGAAATAT CATGGATGCA 

13 51 GGTTGAGACC TCTGCAAAGC ATTTTTATTA TTTTGATCAC GAAAACATCT 

14 01 ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT CGTCGTCTCG 

14 51 CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 

15 01 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA 
1551 TCGCAGACTT AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT 
1S01 GAAGAAT GG A AAAAGTCGCT TGGATTTGCA CCTGGAAAAC TCAGACTAAT 
1651 AC CGAAGAAA ACTACTTTCC GTCCAATTAT GACTTTCAAT AAGAAGATTG 
1701 TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC GAAGTTATTG 
1751 AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 
18 01 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG 

18 51 AGTTTGTTTG CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA 

19 01 ACTATGGATA TCGAAAAGTG ATATGATAGT GTAAACAGAG AAAAACTATC 
1951 AACATTCCTA AAAACTACTA AATTACTTTC TTCAGATTTC TGGATTATGA 
2 0 01 CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA TTCGAAAAAC 
2 051 TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 
2101 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG 
2151 AACAAAATGA CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAG CAA 
22 01 AGAAATTATT TTAAGAAAGA TAACTTACTT CAACCAGTCA TTAATATTTG 

22 51 CCAATATAAT TACATTAACT TTAATGGGAA GTTTTATAAA CAAACAAAAG 

23 01 GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC ATTTTATTAT 
23 51 GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 



FIG. 13 
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24 01 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGA C TTACA GATGACTATC 

2451 TTTTGATTAC AACTCAAGAG AATAATG C AG TATTGTTTAT TGAGAAACTT 

2501 ATAAACGTAA GTCGTGAAAA TGGATTTAAA TTCAATATGA AGAAACTACA 

2 551 GACTAGTTTT CCATTAAGTC CAAGCAAATT TGCAAAATAC GGAATGGATA 

2601 GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG GATTGGCATC 

2 651 TGAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACT TG AG AAT 

2 701 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATG CAAAC A AAGAAAGCAT 

2 751 CAATGTGGCT CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC 

2 8 01 CATTATTTTA GAAAGAC G AT TACAACCGAA GACTTTGCGA ATAAAACTCT 

2 851 CAACAAGTTA TTTATATCAG GCGGTTACAA AT AC ATG CAA TGAGCCAAAG 

2 901 AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG TATGATCGAC 

2 951 TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 

3 0 01 CCTTGTGTGC AATATTAAGG ATACAATTTT TGGAGAGGAG CAT T A T C C AG 
3 051 ACTTTTTCCT TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA 
3101 AAGTACATTT TCAACAGAGT TTGCATGATC CTCAAGGCAA AAG AAG C AAA 
3151 GCTAAAAAGT GACCAATGTC AATCTCTAAT TCAATATGAT GCATAGTCGA 
3 2 01 CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG TCTTATATAC 
3 2 51 TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 

FIG. 13 

(CONTINUED) 



1 MEVDVDNQAD NHGIHSALKT CEEIKEAKTL YSWIQKVIRC RNQSQSHYKD 

51 LEDIKIFAQT NIVATPRDYN EEDFKVIARK EVFSTGLMIE LIDKCLVELL 

101 SSSDVSDRQK LQCFGFQLKG NQLAKTHLLT ALSTQKQYFF QDEWNQVRAM 

151 IGNELFRHLY TKYLIFQRTS EGTLVQFCGN NVFDHLKVND KFDKKQKGGA 

2 01 ADMNEPRCCS TCKYNVKNEK DHFLNNINVP NWNNMKSRTR IFYCTHFNRN 
251 NQFFKKHEFV SNKNNI SAMD RAQTIFTNIF RFMRIRKKLK DKVIEKIAYM 

3 01 LEKVKDFNFN YYLTKSCPLP ENWRERKQKI ENLINKTREE KSKYYEELFS 
351 YTTDNKCVTQ FINEFFYNIL PKDFLTGRNR KNFQKKVKKY VELNKHELIH 
401 KNLLLEKINT REISWMQVET SAKHFYYFDH EN1YVLWKLL RWIFEDLVVS 
451 LIRCFFYVTE QQKSYSKTYY YRKNIWDVIM KMS I ADLKKE TLAEVQEKEV 
501 EEWKKSLGFA PGKLRLIPKK TTFRPIMTFN KKIVNSDRKT TKLTTNTKLL 
551 NSHLMLKTLK NRMFKDPFGF AVFNYDDVMK KYEE FVCKWK QVGQPKLFFA 
601 TMDIEKCYDS VNREKLSTFL KTTKLLSSDF WIMTAQILKR KNNIVIDSKN 
651 FRKKEMKDYF RQKFQKIALE GGQYPTLFSV LENEQNDLNA KKTL1VEAKQ 
701 RNYFKKDNLL QPVINICQYN YINFNGKFYK QTKGIPQGLC VSSILSSFYY 
751 ATLEESSLGF LRDESMNPEN PNVNLLMRLT DDYLLITTQE NNAVLFIEKL 
801 INVSRENGFK FNMKKLQTSF PLSPSKFAKY GMDSVEEQNI VQDYCDWIGI 
851 S I DMKTLALM PNINLRIEGI LCTLNLNMQT KKASMWLKKK LKSFLMNNIT 
901 HYFRKTITTE DFANKTLKFKL FISGGYKYMQ CAKEYKDHFK KNLAMS SMID 
951 LEVSKIIYSV TRAFFKYLVC NIKDTIFGEE HYPDFFLSTL KHFIEIFSTK 

1001 KYIFNRVCMI LKAKEAKLKS DQCQSLIQYD A 



F/G. 14 
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1 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 
3541 
3601 
3661 
3721 
3781 
3841 
3901 
3961 



gcagcgctgc 
gcgcgctccc 
gccgctggcc 
ggacccggcg 
acggccgccc 
ccgagtgctg 
gctgctggac 
cctgcccaac 
ccgcgtgggc 
ggctcccagc 
tcaggcccgg 
ctggaaccat 
gaggcgcggg 
tgcccctgag 
gcgtggaccg 
cacctctttg 
gcaccacgcg 
cccggtgtac 
gccctccttc 
gaccatcttt 
gccccagcgc 
gtgcccctac 
agccggtgtc 
cacagacccc 
cggcttcgtg 
caacgaacgc 
gctctcgctg 
gagcccaggg 
caagttcctg 
tgtcacggag 
caagttgcaa 
ggaagcagag 
cttcatcccc 
cagaacgttc 
cagcgtgctc 
cctggacgat 
gccgcctgag 
ggacaggctc 
tcggtatgcc 
cgtctctacc 
gaccagcccg 
cagtggcctc 
caagtcctac 
cagcctgtgc 
gctcctgcgt 
cttcctcagg 
gacagtggtg 
gccggcccac 
gcagagcgac 
cggcttcaag 
tcacagcctg 
caagatcctc 
tcagcaagtt 
ctgctactcc 
cggccctctg 
gactcgacac 
gctgagtcgg 
actgccctca 
gagcagacac 
cacacccagg 
catgtccggc 
gagtgtccag 
gggccagctt 
ccagattcgc 
aggtggagac 
ccctgtacac 
gaggtgctgt 



gtcctgctgc 
cgctgccgag 
acgttcgtgc 
gctttccgcg 
cccgccgccc 
cagaggctgt 
9gggcccgcg 
acggtgaccg 
gacgacgtgc 
tgcgcctacc 
cccccgccac 
agcgtcaggg 
ggcagtgcca 
ccggagcgga 
agtgaccgtg 
gagggtgcgc 
ggccccccat 
gccgagacca 
ctactcagct 
ctgggttcca 
tactggcaaa 

ggggtgctcc 

tgtgcccggg 
cgtcgcctgg 

c gggcctgcc 

cgcttcctca 
caggagctga 
gttggctgtg 
cactggctga 
accacgtttc 
agcattggaa 
gtcaggcagc 
aagcctgacg 
cgcagagaaa 
aactacgagc 
atccacaggg 
ctgtactttg 
acggaggtca 
gtggtccaga 
ttgacagacc 
ctgagggatg 
ttcgacgtct 
gtccagtgcc 
tacggcgaca 
ttggtggatg 
accctggtcc 
aacttccctg 
ggcctattcc 
tactccagct 
gctgggagga 
tttctggatt 
ctgctgcagg 
tggaagaacc 
atcctgaaag 
ccctccgagg 
cgtgtcacct 
aagctcccgg 
gacttcaaga 
cagcagccct 
cccgcaccgc 
tgaaggctga 
cacacctgcc 
ttcctcacca 
cattgttcac 
cctgagaagg 
aggcgaggac 
gggagtaaaa 



gcacgtggga 
ccgtgcgctc 
ggcgcctggg 
cgctggtggc 
cctccttccg 
gcgagcgcgg 

ggggcccccc 

acgcactgcg 
tggttcacct 
aggtgtgcgg 
acgctagtgg 
aggccggggt 
gccgaagtct 
cgcccgttgg 
gtttctgtgt 
tctctggcac 
ccacatcgcg 
agcacttcct 
ctctgaggcc 
ggccctggat 

tgcggcccct 

tcaagacgca 
agaagcccca 
tgcagctgct 
tgcgccggct 
ggaacaccaa 

cgtggaagat 

ttccggccgc 
tgagtgtgta 
aaaagaacag 
tcagacagca 
atcgggaagc 
ggctgcggcc 
agagggccga 
gggcgcggcg 
cctggcgcac 
tcaaggtgga 
tcgccagcat 
aggccgccca 
tccagccgta 
ccgtcgtcat 
tcctacgctt 

aggggatccc 

tggagaacaa 
atttcttgtt 
gaggtgtccc 
tagaagacga 
cctggtgcgg 
atgcccggac 
acatgcgtcg 
tgcaggtgaa 
cgtacaggtt 
ccacattttt 
ccaagaacgc 
ccgtgcagtg 
acgtgccact 
ggacgacgct 
ccatcctgga 
gtcacgccgg 
tgggagtctg 
gtgtccggct 
gtcttcactt 
ggagcccggc 
ccctcgccct 
accctgggag 
cctgcacctg 
tactgaatat 



agccctggcc 
cctgctgcgc 
gccccagggc 
ccagtgcctg 
ccaggtgtcc 
cgcgaagaac 
cgaggccttc 
ggggagcggg 
gctggcacgc 
gccgccgctg 
accccgaagg 
ccccctgggc 
gccgttgccc 
gcaggggtcc 

ggtgtcacct 
gcgccactcc 
gccaccacgt 
ctactcctca 
cagcctgact 
gccagggact 
gtttctggag 
ctgcccgctg 
gggctctgtg 
ccgccagcac 
ggtgccccca 
gaagttcatc 
gagcgtgcgg 
agagcaccgt 
cgtcgtcgag 
gctctttttc 
cttgaagagg 
caggcccgcc 
gattgtgaac 
gcgtctcacc 
ccccggcctc 
cttcgtgctg 
tgtgacgggc 
catcaaaccc 

tgggcacgtc 

catgcgacag 
cgagcagagc 
catgtgccac 
gcagggctcc 
gctgtttgcg 
ggtgacacct 
tgagtatggc 
ggccctgggt 
cctgctgctg 
ctccatcaga 
caaactcttt 
cagcctccag 
tcacgcatgt 
cctgcgcgtc 
agggatgtcg 
gctgtgccac 
cctggggtca 
gactgccctg 
ctgatggcca 
gctctacgtc 
aggcctgagt 
gaggcctgag 
ccccacaggc 
ttccactccc 
gccctccttt 
ctctgggaat 
gatgggggtc 
atgagttttt 



ccggccaccc 
agccactacc 
tggcggctgg 
gtgtgcgtgc 
tgcctgaagg 
gtgctggcct 
accaccagcg 
gcgtgggggc 
tgcgcgctct 
taccagctcg 
cgtctgggat 
ctgccagccc 
aagaggccca 

tgggcccacc 

gccagacccg 
cacccatccg 
ccctgggaca 
ggcgacaagg 
ggcgctcgga 
ccccgcaggt 
ctgcttggga 
cgagctgcgg 
gcggcccccg 
agcagcccct 
ggcctctggg 
tccctgggga 
gactgcgctt 
ctgcgtgagg 
ctgctcaggt 
taccggaaga 
gtgcagctgc 
ctgctgacgt 
atggactacg 
tcgagggtga 
ctgggcgcct 
cgtgtgcggg 
gcgtacgaca 
cagaacacgt 
cgcaaggcct 
ttcgtggctc 
tcctccctga 
cacgccgtgc 
atcctctcca 
gggattcggc 
cacctcaccc 
tgcgtggtga 
ggcacggctt 
gatacccgga 
gccagtctca 
ggggtcttgc 
acggtgtgca 
gtgctgcagc 
atctctgaca 

ct gggggcca 

caagcattcc 
ctcaggacag 
gaggccgcag 
cccgcccaca 
ccagggaggg 
gagtgtttgg 
cgagtgtcca 
tggcgctcgg 
cacataggaa 
gccttccacc 
ttggagtgac 
cctgtgggtc 
cagttttgaa 



ccgcgatgcc 
gcgaggtgct 
tgcagcgcgg 
cctgggacgc 
agctggtggc 
tcggcttcgc 
tgcgcagcta 
tgctgctgcg 
ttgtgctggt 
gcgctgccac 
gcgaacgggc 
C9ggtgcgag 
ggcgtggcgc 
cgggcaggac 
ccgaagaagc 
tgggccgcca 
cgccttgtcc 
agcagctgcg 
ggctcgtgga 
tgccccgcct 
accacgcgca 
tcaccccagc 
aggaggagga 
ggcaggtgta 
gctccaggca 
agcatgccaa 
ggctgcgcag 
agatGCtggc 
ctttctttta 
gtgtctggag 
gggagctgtc 
ccagactccg 
tcgtgggagc 
aggcactgtt 
ctgtgctggg 
CGcaggaccc 
ccatccccca 
actgcgtgcg 
tcaagagcca 
acctgcagga 
atgaggccag 
gcatcagggg 
cgctgctctg 
gggacgggct 
acgcgaaaac 
acttgcggaa 
ttgttcagat 
ccctggaggt 
ccttcaaccg 
ggctgaagtg 
ccaacatcta 
tcccatttca 
cggcctccct 
agggcgccgc 
tgctcaagct 
cccagacgca 
ccaacccggc 
gccaggccga 
aggggcggcc 
ccgaggcctg 
gccaagggct 
ctccacccca 
tagtccatcc 
cccaccat cc 
caaaggtgtg 
aaattggggg 
aaaaa 
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MPRAPRCRAVRSLLRSHYREVLPLATFVRRLGPQGWRLVQRGDP 
AAFRALVAQCLVCVPWDARP P PAAPS FRQVS CLKEL VARVLQRL 
CERGAKNVLAFGFALLDGARGGPPEAFTTSVRSYLPNTVTDALR 
GSGAWGLLLRRVGDDVLVHLLARCALFVLVAPSCAYQVCGPPLY 
QLGAATQARPPPHASGPRRRLGCERAWNHSVREAGVPLGLPAPG 
ARRRGGSASRSLPLPKRPRRGAAPEPERTPVGQGSWAHPGRTRG 
P S DRGFCWS PAR P AE E AT S L E GAL S GTRH S H P S VGRQHH AG P P 
STSRPPRPWDTPCPPVYAETKHFLYSSGDKEQLRPSFLLSSLRP 
SLTGARRLVETIFLGSRPWMPGTPRRLPRLPQRYWQMRPLFLEL 
LGNHAQCPYGVLLKTHCPLRAAVT PAAGVCAREKPQGSVAAPEE 
EDTDPRRLVQLLRQHS S PWQ V YGFVRACLRRLVP PGLWGSRHNE 
RRFLRNTKKFISLGKHAKLSLQELTWKMSVRDCAWLRRSPGVGC 
VPAAEHRLREEILAKFLHWLMSVYWELLRSFFYVTETTFQKNR 
LFFYRKSVWSKLQS IGIRQHLKRVQLRELSEAEVRQHREARPAL 
LTSRLRFIPKPDGLRPIVNMDYWGARTFRREKRAERLTSRVKA 
LFSVLNYERARRPGLLGASVLGLDDIHRAWRTFVLRVRAQDPPP 
EL YF VKVDVTGAYDT I PQDRLTE VI AS I I KPQNTYCVRRYAVVQ 
KAAHGHVRKAFKSHVSTLTDLQPYMRQFVAHLQETSPLRDAVVI 
EQSSSLNEASSGLFDVFLRFMCHHAVRIRGKSYVQCQGIPQGSI 
LSTLLCSLCYGDMENKLFAGIRRDGLLLRLVDDFLLVTPHLTHA 
KTFLRTLVRGVPEYGCVVNLRKTVVNFPVEDEALGGTAFVQMPA 
HGLFPWCGLLLDTRTLEVQSDYSSYARTSIRASLTFNRGFKAGR 
NMRRKLFGVLRLKCHSLFLDLQVNSLQTVCTNIYKILLLQAYRF 
HACVLQLPFHQQVWKNPTFFLRVI SDTASLCYS I LKAKNAGMSL 
GAKGAAG PL P S E AVQWLCHQAFLLKLTRHRVT YVPLLGS LRTAQ 
TQLSRKLPGTTLTALEAAANPALPSDFKTILD 

FIG. 17 

GGCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTT 
TTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTG 
GAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCT 
GTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACT 
CCGCTTCATCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGG 
AGCCAGAACGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACT 
GTTCAGCGTGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCT 
GGGCCTGGACGATATCCACAGGGCCTGGCGCACCTTCGTGGTGCGTGTGCGGGCCCAGGA 
CCCGCCGCCTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCC 
CCAGGACAGGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGT 
GCGTCGGTATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAG 
CCACGTCCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCT 
GCAGCCTGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGC 
TGCTCCTGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAA 
CCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGA 
AGACAGTGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGA 
TGCCGGCCGACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGG 
TGCAGAGCGACTACTCCAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACC 
GCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGT 
GTCACAGCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCACCAACATCT 
ACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTC 
ATCAGCAAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCC 
TCTGCTACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCG 
CCGGCC7TCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGC 
TGACTCGACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGC 
AGCTGAGTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGG 
CACTGCCCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCG 
AGAGCAGACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGC 
CCACACCCAGGCCTGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCT 
GCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGC 
TGAGTGTCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCC 
AGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATC 
CCCAGATTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATC 
CAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGT 
GCCCTGTACACAGGCGAGGACGCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGG 
GGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGOAAAAAAAAAA 
AAAAAAAAAAAAAAAA 

FIG. 18 
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MetSerValTyrValValGluLeuLeuArgSerPhePhe 
TyrValThrGluThrThrPheGlnLysAsnArgLeuPhe 
PheTyrArgLysSerValTrpSerLysLeuGlnSerlle 
GlylleArgGlnHisLeuLysArgValGlnLeuArgGlu 
LeuSerGluAlaGluValArgGlnHisArgGluAlaArg 
ProAlaLeuLeuThrSerArgLeuArgPhelleProLys 
ProAspGlyLeuArgProIleValAsnMetAspTyrVal 
ValGlyAlaArgThrPheArgArgGluLysArgAlaGlu 
ArgLeuThrSerArgValLysAlaLeuPheSerValLeu 
AsnTyrGluArgAlaArgArgProGlyLeuLeuGlyAla 
SerValLeuGlyLeuAspAspIleHisArgAlaTrpArg 
ThrPheValLeuArgValArgAlaGlnAspProProPro 
GluLeuTyrPheValLysValAspValThrGlyAlaTyr 
AspThrlleProGlnAspArgLeuThrGluVallleAla 
SerllelleLysProGlnAsnThrTyrCysValArgArg 
TyrAlaValValGlnLysAlaAlaHisGlyHisValArg 
LysAlaPheLysSerHisValLeuArgProValProGly 
AspProAlaGlyLeuHisProLeuHisAlaAlaLeuGln 
ProValLeuArgArgHisGlyGluGlnAlaValCysGly 
AspSerAlaGlyArgAlaAlaProAlaPheGlyGly 

FIG. 19 



met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 



70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 



100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

HO 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 



FIG. 20 
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130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 



140 



150 



trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 



170 



180 



J_ o u 

ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arq 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 

200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 



230 



240 



Z,1 u 

ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 



280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 
GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 

290 300 
gly ala leu ser gly thr arg his ser his pro ser val gly arg 
GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 
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leu tyr ser 
CTC TAC TCC 



leu ser ser 
CTC AGC TCT 



glu thr ile 
GAG ACC ATC 



arg arg leu 
CGC AGG TTG 



leu phe leu 
CTG TTT CTG 



val leu leu 
GTG CTC CTC 



ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ser gly asp 
TCA GGC GAC 

350 

leu arg pro 
CTG AGG CCC 



phe leu gly 
TTT CTG GGT 

380 

pro arg leu 
CCC CGC CTG 



glu leu leu 
GAG CTG CTT 

410 

lys thr his 
AAG ACG CAC 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



340 
lys glu gin 
AAG GAG CAG 



ser leu thr 
AGC CTG ACT 

370 

ser arg pro 
TCC AGG CCC 



pro gin arg 
CCC CAG CGC 

400 

gly asn his 
GGG AAC CAC 



cys pro leu 
TGC CCG CTG 

430 

ar g gi u iy s 

CGG GAG AAG 



thr asp pro 
ACA GAC CCC 

460 

pro trp gin 
CCC TGG CAG 



val pro pro 
GTG CCC CCA 

490 

leu arg asn 
CTC AGG AAC 



leu ser leu 
CTC TCG CTG 

520 

ala trp leu 
GCT TGG CTG 



glu his arg 
GAG CAC CGT 



leu arg pro 
CTG CGG CCC 



gly ala arg 
GGC GCT CGG 



trp met pro 
TGG ATG CCA 



tyr trp gin 
TAC TGG CAA 



ala gin cys 
GCG CAG TGC 



arg ala ala 
CGA GCT GCG 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



ser phe leu 
TCC TTC CTA 

360 
arg leu val 
AGG CTC GTG 



gly thr pro 
GGG ACT CCC 

390 

met arg pro 
ATG CGG CCC 



pro tyr gly 
CCC TAC GGG 

420 
val thr pro 
GTC ACC CCA 



ser val ala 
TCT GTG GCG 

450 
val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 
gly ser arg 
GGC TCC AGG 



phe ile ser 
TTC ATC TCC 

510 
thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 
glu ile leu 
GAG ATC CTG 
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ala lys phe 
GCC AAG TTC 



leu arg ser 
CTC AGG TCT 



arg leu phe 
AGG CTC TTT 

ile gly ile 
ATT GGA ATC 



ser glu ala 
TCG GAA GCA 



leu thr ser 
CTG ACG TCC 



pro ile val 
CCG ATT GTG 



&rg glu lys 
AGA GAA AAG 



phe ser val 
TTC AGC GTG 



gly ala ser 
GGC GCC TCT 



thr phe val 
ACC TTC GTG 



tyr phe val 
TAC TTT GTC 



gin asp arg 
CAG GAC AGG 



asn thr tyr 
AAC ACG TAC 



leu his trp 
CTG CAC TGG 

560 

phe phe tyr 
TTC TTT TAT 



phe tyr arg 

TTC TAC CGG 
590 

arg gin his 

AGA CAG CAC 



glu val arg 
GAG GTC AGG 

620 

arg leu arg 
AGA CTC CGC 



asn met asp 
AAC ATG GAC 

650 

arg ala glu 
AGG GCC GAG 



leu asn tyr 
CTC AAC TAC 

680 

val leu gly 
GTG CTG GGC 



leu arg val 
CTG CGT GTG 

710 

lys val asp 
AAG GTG GAT 



leu thr glu 
CTC ACG GAG 

740 

cys val arg 
TGC GTG CGT 



550 
leu met ser 
CTG ATG AGT 



val thr glu 
GTC ACG GAG 

580 

lys ser val 
AAG AGT GTC 

leu lys arg 
TTG AAG AGG 

610 

gin his arg 
CAG CAT CGG 



phe ile pro 
TTC ATC CCC 

640 

tyr val val 
TAC GTC GTG 



arg leu thr 
CGT CTC ACC 

670 

glu arg ala 
GAG CGG GCG 



leu asp asp 
CTG GAC GAT 

700 

arg ala gin 
CGG GCC CAG 



val thr gly 
GTG ACG GGC 

730 

val ile ala 
GTC ATC GCC 



arg tyr ala 
CGG TAT GCC 



val tyr val 
GTG TAC GTC 



thr thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 

val gin leu 
GTG CAG CTG 



glu ala arg 
GAA GCC AGG 



lys pro asp 
AAG CCT GAC 



gly ala arg 
GGA GCC AGA 



ser arg val 
TCG AGG GTG 



arg arg pro 
CGG CGC CCC 



ile his arg 
ATC CAC AGG 



asp pro pro 
GAC CCG CCG 



ala tyr asp 
GCG TAC GAC 



ser ile ile 
AGC ATC ATC 



val val gin 
GTG GTC CAG 



val glu leu 
GTC GAG CTG 

570 
gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 

600 
arg glu leu 
CGG GAG CTG 



pro ala leu 
CCC GCC CTG 

630 
gly leu arg 
GGG CTG CGG 



thr phe arg 
ACG TTC CGC 

660 
lys ala leu 
AAG GCA CTG 



gly leu leu 
GGC CTC CTG 

690 
ala trp arg 
GCC TGG CGC 



pro glu leu 
CCT GAG CTG 

720 

thr ile pro 
ACC ATC CCC 



lys pro gin 
AAA CCC CAG 

750 

lys ala ala 
AAG GCC GCC 
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760 



his gly his val arg lys ala phe lys ser his val leu arg pro 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC CTA CGT CCA 

770 780 
val pro gly asp pro ala gly leu his pro leu his ala ala leu 
GTG CCA GGG GAT CCC GCA GGG CTC CAT CCT CTC CAC GCT GCT CTG 

790 

gin pro val leu arg arg his gly glu gin ala val cys qly asp 
CAG CCT GTG CTA CGG CGA CAT GGA GAA CAA GCT GTT TGC GGG GAT 

800 807 
ser ala gly arg ala ala pro ala phe gly gly OP 

TCG GCG GGA CGG GCT GCT CCT GCG TTT GGT GGA TGA TTTCTTGTTGGT 

GACACCTCACCTCACCCACGCGAAAACCTTCCTCAGGACCCTGGTCCGAGGTGTCCCTGA 

GTATGGCTGCGTGGTGAACTTGCGGAAGACAGTGGTGAACTTCCCTGTAGAAGACGAGGC 

CCTGGGTGGCACGGCTTTTGTTCAGATGCCGGCCCACGGCCTATTCCCCTGGTGCGGCCT 

GCTGCTGGATACCCGGACCCTGGAGGTGCAGAGCGACTACTCCAGCTATGCCCGGACCTC 

CATCAGAGCCAGTCTCACCTTCAACCGCGGCTTCAAGGCTGGGAGGAACATGCGTCGCAA 

ACTCTTTGGGGTCTTGCGGCTGAAGTGTCACAGCCTGTTTCTGGATTTGCAGGTGAACAG 

CCTCCAGACGGTGTGCACCAACATCTACAAGATCCTCCTGCTGCAGGCGTACAGGTTTCA 

CGCATGTGTGCTGCAGCTCCCATTTCATCAGCAAGTTTGGAAGAACCCCACATTTTTCCT 

GCGCGTCATCTCTGACACGGCCTCCCTCTGCTACTCCATCCTGAAAGCCAAGAACGCAGG 

GATGTCGCTGGGGGCCAAGGGCGCCGCCGGCCCTCTGCCCTCCGAGGCCGTGCAGTGGCT 

GTGCCACCAAGCATTCCTGCTCAAGCTGACTCGACACCGTGTCACCTACGTGCCACTCCT 

GGGGTCACTCAGGACAGCCCAGACGCAGCTGAGTCGGAAGCTCCCGGGGACGACGCTGAC 

TGCCCTGGAGGCCGCAGCCAACCCGGCACTGCCCTCAGACTTCAAGACCATCCTGGACTG 

ATGGCCACCCGCCCACAGCCAGGCCGAGAGCAGACACCAGCAGCCCTGTCACGCCGGGCT 

CTACGTCCCAGGGAGGGAGGGGCGGCCCACACCCAGGCCCGCACCGCTGGGAGTCTGAGG 

CCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAG 

GCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCCGTCTTCACTTCCC 

CACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTC 

CACTCCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCCTCGCCCTGCC 

CTCCTTTGCCTTCCACCCCCACCATCCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTC 

TGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGAT 

GGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATG 

AGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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1 CCATGGGACCCACTGCAGGGGCAGCTGGGAGGCTGCAGGCTTCAGGTCCCAGTGGGGTTG 
GGTACCCTGGGTGACGTCCCCGTCGACCCTCCGACGTCCGAAGTCCAGGGTCACCCCAAC 

6 1 CCATCTGCCAGTAGAAACCTGATGTAGAATCAGGGCGCGAGTGTGGACACTGTCCTGAAT 
GGTAGACGGTCATCTTTGGACTACATCTTAGTCCCGCGCTCACACCTGTGACAGGACTTA 

121 CTCAATGTCTCAGTGTGTGCTGAAACATGTAGAAATTAAAGTCCATCCCTCCTACTCTAC 
GAGTTACAGAGTCACACACGACTTTGTACATCTTTAATTTCAGGTAGGGAGGATGAGATG 

181 TGGGATTGAGCCCCTTCCCTATCCCCCCCCAGGGGCAGAGGAGTTCCTCTCACTCCTGTG 
ACCCTAACTCGGGGAAGGGATAGGGGGGGGTCCCCGTCTCCTCAAGGAGAGTGAGGACAC 

241 GAGGAAGGAATGATACTTTGTTATTTTTCACTGCTGGTACTGAATCCACTGTTTCATTTG 
CTCCTTCCTTACTATGAAACAATAAAAAGTGACGACCATGACTTAGGTGACAAAGTAAAC 



3 01 TTGGTTTGTTTGTTTTGTTTTGAGAGGCGGTTTCACTCTTGTTGCTCAGGCTGGAGGGAG 
AACCAAACAAACAAAACAAAACTCTCCGCCAAAGTGAGAACAACGAGTCCGACCTCCCTC 



3 61 TGCAATGGCGCGATCTTGGCTTACTGCAGCCTCTGCCTCCCAGGTTCAAGTGATTCTCCT 
AGGTTACCGCGCTAGAACCGAATGACGTCGGAGACGGAGGGTCCAAGTTCACTAAGAGGA 

alu 

421 GCTTCCGCCTCCCATTTGGCTGGGATTACAGGCACCCGCCACCATGCCCAGCTAATTTTT 
CGAAGGCGGAGGGTAAACCGACCCTAATGTCCGTGGGCGGTGGTACGGGTCGATTAAAAA 



4 81 TGTATTTTTAGTAGAGACGGGGGTGGGGGTGGGGTTCACCATGTTGGCCAGGCTGGTCTC 
ACATAAAAATCATCTCTGCCCCCACCCCCACCCCAAGTGGTACAACCGGTCCGACCAGAG 

CAP 



******************************* 

541 GAACTTCTGACCTCAGATGATCCACCTGCCTCTGCCTCCTAAAGTGCTGGGATTACAGGT 
CTTGAAGACTGGAGTCTACTAGGTGGACGGAGACGGAGGATTTCACGACCCTAATGTCCA 



********************* 

601 GTGAGCCACCATGCCCAGCTCAGAATTTACTCTGTTTAGAAACATCTGGGTCTGAGGTAG 
CACTCGGTGGTACGGGTCGAGTCTTAAATGAGACAAATCTTTGTAGACCCAGACTCCATC 

CCAAT 
***************> 

661 GAAGCTCACCCCACTCAAGTGTTGTGGTGTTTTAAGCCAATGATAGAATTTTTTTATTGT 
CTTCGAGTGGGGTGAGTTCACAACACCACAAAATTCGGTTACTATCTTAAAAAAATAACA 

7 21 TGTTAGAACACTCTTGATGTTTTACACTGTGATGACTAAGACATCATCAGCTTTTCAAAG 
ACAATCTTGTGAGAACTACAAAATGTGACACTACTGATTCTGTAGTAGTCGAAAAGTTTC 
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CAP 

***************> 

781 ACACACTAACTGCACCCATAATACTGGGGTGTCTTCTGGGTATCAGCGATCTTCATTGAA 
TGTGTGATTGA CGTGGGTATTATGAC C C CACAGAAGAC C CATAGT CGCTAGAAGTAACTT 

CAP 

************ 

841 TGCCGGGAGGCGTTTCCTCGCCATGCACATGGTGTTAATTACTCCAGCATAATCTTCTGC 
ACGGCCCTCCGCAAAGGAGCGGTACGTGTACCACAATTAATGAGGTCGTATTAGAAGACG 



* * * > 

901 TTCCATTTCTTCTCTTCCCTCTTTTAAAATTGTGTTTTCTATGTTGGCTTCTCTGCAGAG 
AAGGTAAAGAAGAGT^AGGGAGAAAATTTTAACACAAAAGATACAACCGAAGAGACGTCTC 

CAP 

961 AACCAGTGTAAGCTACAACTTAACTTTTGTTGGAACAAATTTTCCAAACCGCCCCTTTGC 
TTGGTCACATTCGATGTTGAATTGAAAACAACCTTGTTTAAAAGGTTTGGCGGGGAAACG 

1021 CCTAGTGGCAGAGACAATTCACAAACACAGCCCTTTAAAAAGGCTTAGGGATCACTAAGG 
GGATCACCGTCTCTGTTAAGTGTTTGTGTCGGGAAATTTTTCCGAATCCCTAGTGATTCC 

1081 GGATTTCTAGAAGAGCGACCCGTAATCCTTAAGTATTTACAAGACGAGGCTAACCTCCAG 
CCTAAAGATCTTCTCGCTGGGCATTAGGAATTCATAAATGTTCTGCTCCGATTGGAGGTC 

1141 CGAGCGTGACAGCCCAGGGAGGGTGCGAGGCCTGTTCAAATGCTAAGCTTCCATAAATAA 
GCTCGCACTGTCGGGTCCCTCCCACGCTCCGGACAAGTTTACGATTCGAAGGTATTTATT 

1201 AGCAAATTTCCTCCGGCAGTTTCTGGAAAGTAGG7VAAGGTTAACATTTAAGGTTGCGTTT 
TCGTTTAAAGGAGGCCGTCAAAGACCTTTCATCCTTTCCAATTGTAAATTCCAACGCAAA 

1261 GTTAGCATTTCAGTGTTTGCCGACCTCAGCTAACAGCATCCCTGCAAGGCCTCGGGAGAC 
CAATCGTAAAGTCACAAACGGCTGGAGTCGATTGTCGTAGGGACGTTCCGGAGCCCTCTG 

1321 CCAGAAGTTTCTCGCCCCTTAGATCCAAACTTGAGCAACCCGGAGTCTGGATTCCTGGGA 
GGTCTTCAAAGAGCGGGGAATCTAGGTTTGAACTCGTTGGGCCTCAGACCTAAGGACCCT 

TopoII 

1381 AGTCCTCAGCTGTCCTGCGGTTGTGCCGGGGCCCCAGGTCTGGAGGGGACCAGTGGCCGT 
TCAGGAGTCGACAGGACGCCAACACGGCCCCGGGGTCCAGACCTCCCCTGGTCACCGGCA 

1441 GTGGCTTCTACTGCTGGGCTGGAAGTCGGGCCTCCTAGCTCTGCAGTCCGAGGCTTGGAG 
CACCGAAGATGACGACCCGACCTTCAGCCCGGAGGATCGAGACGTCAGGCTCCGAACCTC 



1501 CCAGGTGCCTGGACCCCGAGGCTGCCCTCCACCCTGTGCGGGCGGGATGTGACCAGATGT 
GGTCCACGGACCTGGGGCTCCGACGGGAGGTGGGACACGCCCGCCCTACACTGGTCTACA 

1561 TGGCCTCATCTGCCAGACAGAGTGCCGGGGCCCAGGGTCAAGGCCGTTGTGGCTGGTGTG 
ACCGGAGTAGACGGTCTGTCTCACGGCCCCGGGTCCCAGTTCCGGCAACACCGACCACAC 

1621 AGGCGCCCGGTGCGCGGCCAGCAGGAGCGCCTGGCTCCATTTCCCACCCTTTCTCGACGG 
TCCGCGGGCCACGCGCCGGTCGTCCTCGCGGACCGAGGTAAAGGGTGGGAAAGAGCTGCC 
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1681 GACCGCCCCGGTGGGTGATTAACAGATATTGGGGTGGTTTGCTCATGGTGGGGACCCCTT 
CTGGCGGGGCCACCCACTAATTGTCTATAACCCCACCAAACGAGTACCACCCCTGGGGAA 

1741 CGCCGCCTGAGAACCTGCAAAGAGAAATGACGGGCCTGTGTCAAGGAGCCCAAGTCGCGG 
GCGGCGGACTCTTGGACGTTTCTCTTTACTGCCCGGACACAGTTCCTCGGGTTCAGCGCC 

18 01 GGAAGTGTTGCAGGGAGGCACTCCGGGAGGTCCCGCGTGCCCGTCCAGGGAGCAATGCGT 
CCTTCACAACGTCCCTCCGTGAGGCCCTCCAGGGCGCACGGGCAGGTCCCTCGTTACGCA 

18 61 CCTCGGGTTCGTCCCCAGCCGCGTCTACGCGCCTCCGTCCTCCCCTTCACGTCCGGCATT 
GGAGCCCAAGCAGGGGTCGGCGCAGATGCGCGGAGGCAGGAGGGGAAGTGCAGGCCGTAA 

1921 CGTGGTGCCCGGAGCCCGACGCCCCGCGTCCGGACCTGGAGGCAGCCCTGGGTCTCCGGA 
GCACCACGGGCCTCGGGCTGCGGGGCGCAGGCCTGGACCTCCGTCGGGACCCAGAGGCCT 

1981 TCAGGCCAGCGGCCAAAGGGTCGCCGCACGCACCTGTTCCCAGGGCCTCCACATCATGGC 
AGTCCGGTCGCCGGTTTCCCAGCGGCGTGCGTGGACAAGGGTCCCGGAGGTGTAGTACCG 

2 041 CCCTCCCTCGGGTTACCCCACAGCCTAGGCCGATTCGACCTCTCTCCGCTGGGGCCCTCG 
GGGAGGGAGCCCAATGGGGTGTCGGATCCGGCTAAGCTGGAGAGAGGCGACCCCGGGAGC 

Spl 
******** 

2101 CTGGCGTCCCTGCACCCTGGGAGCGCGAGCGGCGCGCGGGCGGGGAAGCGCGGCCCAGAC 
GACCGCAGGGACGTGGGACCCTCGCGCTCGCCGCGCGCCCGCCCCTTCGCGCCGGGTCTG 

2161 CCCCGGGTCCGCCCGGAGCAGCTGCGCTGTCGGGGCCAGGCCGGGCTCCCAGTGGATTCG 
GGGGCCCAGGCGGGCCTCGTCGACGCGACAGCCCCGGTCCGGCCCGAGGGTCACCTAAGC 

2221 CGGGCAACAGACGCCCAGGACCGCGCTTCCCACGTGGCGGAGGGACTGGGGACCCGGGCA 
GCCCGTTGTCTGCGGGTCCTGGCGCGAAGGGTGCACCGCCTCCCTGACCCCTGGGGCCGT 

Spl 



E2F 
******** 

22 81 CCGGTCCTGCCCCTTCACCTTCCAGCTCCGCCTCGTCCGCGCGGAACCCCGCCCCGTCCC 
GGCCAGGACGGGGAAGTGGAAGGTCGAGGCGGAGCAGGCGCGCCTTGGGGCGGGGCAGGG 

2 341 GAACCCTTCCCGGGTCCCCGGCCCAGCCCCTTCCGGGCCATCCCAGCCCGTCCCGTTCCT 
CTTGGGAAGGGCCCAGGGGCCGGGTCGGGGAAGGCCCGGTAGGGTCGGGCAGGGCAAGGA 

Spl 



E2F NFkB 
********* ****** * * * * * * * * * * * * * * * * * * * * * * * 

24 01 TTTCCGCGGCCCCGCCCTCTCCTCGCGGCGCGAGTTTCAGGCAGCGCTGCGTCCTGCTGC 
AAAGGCGCCGGGGCGGGAGAGGAGCGCCGCGCTCAAAGTCCGTCGCGACGCAGGACGACG 



hTRT5 ' 

***************************** 



> 



24 61 GCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCCGCGCGCTCCCCGCTGCCGAG 
CGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGGCGCGCGAGGGGCGACGGCTC 

2521 CCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCTGCCGCTGGCCACGTTCGTGC 
GGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGACGGCGACCGGTGCAAGCACG 
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E2F 
******* 

2581 GGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGGGGACCCGGCGGCTTTCCGCG 
CCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCCCCTGGGCCGCCGAAAGGCGC 



* 

2641 CGCTGGTGGCCCAGTGCCTGGTGTGCGTGCCCTGGGACGCACGGCCGCCCCCCGCCGCCC 
GCGACCACCGGGTCACGGACCACACGCACGGGACCCTGCGTGCCGGCGGGGGGCGGCGGG 

NFkB 



********************************************** 
27 01 CCTCCTTCCGCCAGGTGGGCCTCCCCGGGGTCGGCGTCCGGCTGGGGTTGAGGGCGGCCG 
GGAGGAAGGCGGTCCACCCGGAGGGGCCCCAGCCGCAGGCCGACCCCAACTCCCGCCGGC 

Topo_I I_c 1 eavag 



NFkB 
++++++++++ 
NFkB 



Intronl 

*********************************************************> 

2 761 GGGGGAACCAGCGACATGCGGAGAGCAGCGCAGGCGACTCAGGGCGCTTGCCCCGCAGGT 
CCCCCTTGGTCGCTGTACGCCTCTCGTCGCGTCCGCTGAGTCCCGCGAAGGGGGCGTCCA 

e site 



2 821 GTCCTGCCTGAAGGAGCTGGTGGCCCGAGTGCTGCAGAGGCTGTGCGAGCGCGGCGCGAA 
CAGGACGGACTTCCTCGACCACCGGGCTCACGACGTCTCCGACACGCTCGCGCCGCGCTT 

2881 GAACGTGCTGGCCTTCGGCTTCGCGCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGC 
CTTGCACGACCGGAAGCCGAAGCGCGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCG 



2 941 CTTCACCACCAGCGTGCGCAGCTACCTGCCCAACACGGTGACCGACGCACTGCGGGGGAG 

GAAGTGGTGGTCGCACGCGTCGATGGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTC 

3 001 CGGGGCGTGGGGGCTGCTGCTGCGCCGCGTGGGCGACGACGTGCTGGTTCACCTGCTGGC 

GCCCCGCACCCCCGACGACGACGCGGCGCACCCGCTGCTGCACGACCAAGTGGACGACCG 

3 061 ACGCTGCGCGCTCTTTGTGCTGGTGGCTCCCAGCTGCGCCTACCAGGTGTGCGGGCCGCC 
TGCGACGCGCGAGAAACACGACCACCGAGGGTCGACGCGGATGGTCCACACGCCCGGCGG 

3121 GCTGTACCAGCTCGGCGCTGCCACTCAGGCCCGGCCCCCGCCACACGCTAGTGGACCCCG 
CGACATGGTCGAGCCGCGACGGTGAGTCCGGGCCGGGGGCGGTGTGCGATCACCTGGGGC 

3181 AAGGCGTCTGGGATGCGAACGGGCCTGGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCT 
TTCCGCAGACCCTACGCTTGCCCGGACCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGA 

3241 GGGCCTGCCAGCCCCGGGTGCGAGGAGGCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTT 
CCCGGACGGTCGGGGCCCACGCTCCTCCGCGCCCCCGTCAGGGTCGGCTTCAGACGGCAA 

33 01 GCCCAAGAGGCCCAGGCGTGGCGCTGCCCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGG 
CGGGTTCTCCGGGTCCGCACCGCGACGGGGACTCGGCCTCGCCTGCGGGCAACCCGTCCC 
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33 61 GTCCTGGGCCCACCCGGGCAGGACGCGTGGACCGAGTGACCGTGGTTTCTGTGTGGTGTC 
CAGGACCCGGGTGGGCCCGTCCTGCGCACCTGGCTCACTGGCACCAAAGACACACCACAG 

3421 ACCTGCCAGACCCGCCGAAGAAGCCACCTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCA 
TGGACGGTCTGGGCGGCTTCTTCGGTGGAGAAACCTCCCACGCGAGAGACCGTGCGCGGT 

3481 CTCCCACCCATCCGTGGGCCGCCAGCACCACGCGGGCCCCCCATCCACATCGCGGCCACC 
GAGGGTGGGTAGGCACCCGGCGGTCGTGGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGG 

3 541 ACGTCCCTGGGACACGCCTTGTCCCCCGGTGTACGCCGAGACCAAGCACTTCCTCTACTC 
TGCAGGGACCCTGTGCGGAACAGGGGGCCACATGCGGCTCTGGTTCGTGAAGGAGATGAG 

3 6 01 CTCAGGCGACAAGGAGCAGCTGCGGCCCTCCTTCCTACTCAGCTCTCTGAGGCCCAGCCT 
GAGTCCGCTGTTCCTCGTCGACGCCGGGAGGAAGGATGAGTCGAGAGACTCCGGGTCGGA 

3 661 GACTGGCGCTCGGAGGCTCGTGGAGACCATCTTTCTGGGTTCCAGGCCCTGGATGCCAGG 
CTGACCGCGAGCCTCCGAGCACCTCTGGTAGAAAGACCCAAGGTCCGGGACCTACGGTCC 

3 721 GACTCCCCGCAGGTTGCCCCGCCTGCCCCAGCGCTACTGGCAAATGCGGCCCCTGTTTCT 
CTGAGGGGCGTCCAACGGGGCGGACGGGGTCGCGATGACCGTTTACGCCGGGGACAAAGA 

3 781 GGAGCTGCTTGGGAACCACGCGCAGTGCCCCTACGGGGTGCTCCTCAAGACGCACTGCCC 
CCTCGACGAACCCTTGGTGCGCGTCACGGGGATGCCCCACGAGGAGTTCTGCGTGACGGG 

3 841 GCTGCGAGCTGCGGTCACCCCAGCAGCCGGTGTCTGTGCCCGGGAGAAGCCCCAGGGCTC 
CGACGCTCGACGCCAGTGGGGTCGTCGGCCACAGACACGGGCCCTCTTCGGGGTCCCGAG 

3 901 TGTGGCGGCCCCCGAGGAGGAGGACACAGACCCCCGTCGCCTGGTGCAGCTGCTCCGCCA 
ACACCGCCGGGGGCTCCTCGTCCTGTGTCTGGGGGCAGCGGACCACGTCGACGAGGCGGT 

3 961 GCACAGCAGCCCCTGGCAGGTGTACGGCTTCGTGCGGGCCTGCCTGCGCCGGCTGGTGCC 

CGTGTCGTCGGGGACCGTCCACATGCCGAAGCACGCCCGGACGGACGCGGCCGACCACGG 

4 021 CCCAGGCCTCTGGGGCTCCAGGCACAACGAACGCCGCTTCCTCAGGAACACCAAGAAGTT 

GGGTCCGGAGACCCCGAGGTCCGTGTTGCTTGCGGCGAAGGAGTCCTTGTGGTTCTTCAA 



4 081 CATCTCCCTGGGGAAGCATGCCAAGCTCTCGCTGCAGGAGCTGACGTGGAAGATGAGCGT 
GTAGAGGGACCCCTTCGTACGGTTCGAGAGCGACGTCCTCGACTGCACCTTCTACTCGCA 



**************************** 
4141 GCGGGACTGCGCTTGGCTGCGCAGGAGCCCAGGTGAGGAGGTGGTGGCCGTCGAGGGCCC 
CGCCGTGACGCGAACCGACGCGTCCTCGGGTCCACTCCTCCACCACCGGCAGCTCCCGGG 

Intron2 

************************************** 

42 01 AGGCCCCAGAGCTGAATGCAGTAGGGGCTCAGAAAAGGGGGCAGGCAGAGCCCTGGTCCT 
TCCGGGGTCTCGACTTACGTCATCCCCGAGTCTTTTCCCCCGTCCGTCTCGGGACCAGGA 



************************************************************ 

4261 CCTGTCTCCATCGTCACGTGGGCACACGTGGCTTTTCGCTCAGGACGTCGAGTGGACACG 
GGACAGAGGTAGCAGTGCACCCGTGTGCACCGAAAAGCGAGTCCTGCAGCTCACCTGTGC 



*****> 

4321 GTGATCGAGGTCGAC 
CACTAGCTCCAGCTG 
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gccaagttcctgcactggctgatgagtgtgtacgtcgtcgagctgctcaggtctttcttt 
tatgtcacggagaccacgtttcaaaagaacaggctctttttctaccggaagagtgtctgg 
agcaagttgcaaagcattggaatcagacagcacttgaagagggtgcagctgcgggacgtg 
tcggaagcagaggtcaggcagcatcgggaagccaggcccgccctgctgacgtccagactc 
cgcttcatccccaagcctgacgggctgcggccgattgtgaacatggactacgtcgtggga 
gccagaacgttccgcagagaaaagagggccgagcgtctcacctcgagggtgaaggcactg 
ttcagcgtgctcaactacgagcgggcgcg 
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TCTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGAG 
ACCAGCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCC 
AGCAGTGGCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATC 
AGGGGCAAGTC 
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1 CCCCAAAACC CCAAAACCCC AAAACCCCTA TAAAAAAAGA AAAAATTGAG 

51 GTAGTTTAGA AATAAAATAT TATTCCCGCA CAAATGGAGA TGGATATTGA 

101 TTTGGATGAT ATAGAAAATT TACTTCCTAA TACATTCAAC AAGTATAGCA 

151 GCTCTTGTAG TGACAAGAAA GGATGCAAAA CATTGAAATC TGGCTCGAAA 

2 01 TCGCCTTCAT TGACTATTCC AAAGTTGCAA AAAC AAT TAG AGTTCTACTT 

2 51 CTCGGATGCA AATCTTTATA ACGATTCTTT CTTGAGAAAA TTAGTTTTAA 

3 01 AAAGCGGAGA GCAAAGAGTA GAAATTGAAA CATTACTAAT GTTTAAATAA 

3 51 AATCAGGTAA TGAGGATTAT TCTATTTTTT AGATCACTTC TTAAGGAGCA 

4 01 TTATGGAGAA AATTACTTAA TACTAAAAGG TAAACAGTTT GGATTATTTC 

4 51 CCTAGCCAAC AATGATGAGT ATATTAAATT CATATGAGAA TGAGTCAAAG 

5 01 GATCTCGATA CATCAGACTT ACCAAAGACA AACTCGCTAT AAAACGCAAG 

5 51 AAAAAGTTTG ATAATCGAAC AGCAGAAGAA CTTATTGCAT TTACTATTCG 

6 01 TATGGGTTTT ATTACAATTG TTTTAGGTAT CGACGGTGAA CTCCCGAGTC 
651 TTGAGACAAT TGAAAAAGCT GTTTACAACT GAAGGAATCG CAGTTCTGAA 

7 01 AGTTCTGATG TGTATGCCAT TATTTTGTGA ATTAATCTCA AATATCTTAT 
751 CTCAATTTAA TGGATAGCTA TAGAAACAAA CCAAATAAAC CATGCAAGTT 

8 01 TAATGGAATA TACGTTAAAT CCTTTGGGAC AAATGCACAC TGAATTTATA 

8 51 TTGGATTCTT AAAGCATAGA TACACAGAAT G C T T TAG AG A CTGATTTAGC 

9 01 TTACAACAGA TTACCTGTTT TGATTACTCT TGCTCATCTC TTATATCTTT 
951 AAAAGAAGCA GGCGAAATGA AAAGAAGACT AAAGAAAGAG ATTTCAAAAT 

10 01 TTGTTGATTC TTCTGTAACC GGAATTAACA ACAAGAATAT TAGCAACGAA 

1051 AAAGAAGAAG AGCTATCACA ATCCTGATTC TTAAAGATTT C AAAAATT C C 

1101 AGGTAAGAGA GAT AC ATT C A TTAAAATTCA TATATTATAG TTTTTCATTT 

1151 CACAGCTGTT ATTTTCTTTT ATCTTAACAA TATTTTTTGA TTAGCTGGAA 

12 01 GTAAAAAGTA TCAAATAAGA GAAGCGCTAG ACTGAGGTAA CTTAGCTTAT 

12 51 TCACATTCAT AGATCGACCT T CAT AT AT C C AATACGATGA TAAGGAAACA 

13 01 GCAGTCATCC GTTTTAAAAA TAGTGCTATG AGGACTAAAT TTTTAGAGTC 

13 51 AAGAAATGGA GCCGAAATCT T AAT C A A A A A GAATTGCGTC GATATTGCAA 

14 01 AAGAATCGAA CTCTAAATCT TTCGTTAATA AGTATTACCA ATCTTGATTG 

14 51 ATTGAAGAGA TTGACGAGGC AACTGCACAG AAGATCATTA AAGAAATAAA 

15 01 GTAACTTTTA TTAATTAGAG AATAAAC TAA ATTACTAATA TAGAGATCAG 

15 51 CGATCTTCAA TTGACGAAAT AAAAGCTGAA CTAAAGTTAG ACAATAAAAA 

16 01 ATACAAACCT TGGTCAAAAT ATTGAGGAAG GAAAAGAAGA CCAGTTAGCA 
1651 AAAGAAAAAA TAAGGCAATA AATAAAATGA GTACAGAAGT GAAGAAATAA 
1701 AAGATTTATT TTTTTCAATA ATTTATTGAA AAGAGGGGTT TTGGGGTTTT 
1751 GGGGTTTTGG GG 
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C C C C AAAAC C C C AAAAC C C C AAAAC C C CT ATAAAAAAAGAAAAAAT T GAGGT AGTTT AGA 

1 + + + + + + 60 

GGGGTTTTGGGGTTTTGGGGTTTTGGGGATATTTTTTTC TTTTTTAAC TC C ATC AAATC T 

a P Q N PKT PKPL * KKKKL R * F R 

b PKTPKPQNPYKKRKN* GSLE - 

c PKPQNPKTPIKKEKI E V V * K - 

AATAAAATATTATTCCCGCACAAATGGAGATGGATATTGATTTGGATGATATAGAAAATT 

61 + + - + + + 4- 120 

TTATTTTATAATAAGGGCGTGTTTACCTCTACCTATAACTAAACCTACTATATCTTTTAA 

a NKILFPHKWRWILIWMI*KI 

b I KYY S RTNGDG Y * FG * YRKF 

c * N I I PAQMEMDIDLD'DIENL- 

TAGTTCCTAATACATTCAACAAGTATAGCAGCTCTTGTAGTGACAAGAAAGGATGCAAAA 

121 + + + + + + 180 

ATGAAGGATTATGTAAGTTGTTCATATCGTCGAGAACATCACTGTTCTTTCCTACGTTTT 

a Y F L I H S T S I AALVVT RKDA K 

b TS *YIQQV*QLL * CQERMQN- 

c LPNTFNKYSSSCSD KKGCKT- 

CATTGAAATCTGGCTCGAAATCGCCTTCATTGACTATTCCAAAGTTGCAAAAACAATTAG 

181 + + + + + + 240 

GTAACTTTAGACCGAGCTTTAGCGGAAGTAACTGATAAGGTTTCAACGTTTTTGTTAATC 

a H*NLARNRLH*LFQSCKNN* 

b I E IWLE I AF I DY S KVAKT I R - 

C IjKSGSKSPSLTI PK-LQKQLE- 

AGTTCTACTTCTCGGATGCAAATCTTTATAACGATTCTTTCTTGAGAAAATTAGTTTTAA 

241 + + + + 4- + 300 

TC AAGATGAAG AGC C TAG GTTTAGAAATATTGC TAAGAAAGAAC TC TTTTAATC AAAATT 

a SSTSRMQIFITILS*EN*F* 

b VLLLGCKSL*RF FLEKISFK- 

c FYFSDANL YNDSFLR KLVLK- 

AAAGCGGAGAGCAAAGAGTAGAAATTGAAACATTACTAATGTTTAAATAAAATCAGGTAA 

301 + + + 4- 4- 4- 360 

TTTCGCCTCTCGTTTCTCATCTTTAACTTTGTAATGATTACAAATTTATTTTAGTCCATT 

a KAESKE *KLKHY*CLNKIR* 

b KRRAKSRN*NITN'V*IKSGN- 

c SGEQRVEI ETLLMFK*NQVM- 

TGAGGATTATTCTATTTTTTAGATCACTTCTTAAGGAGCATTATGGAGAAAATTACTTAA 

AC TC C T AATAAGATAAAAAATC TAGTGAAGAATTC C TC GTAATAC C TC TTTTAATGAATT 

a *GLFYFLDHFLRSIMEKIT"* 

b EDYS IF *ITS*GALWRKLLN- 

C RIILFFR SLLKEHYGENYLI- 
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TACTAAAAGGTAAACAGTTTGGAT^^ 

421 — .-~™~ — + + + + 480 

ATGATTTTCC ATTTGl/C AAACC TAATAAAGGGATCGGTTGTTACTAC TC ATATAATTTAA 

a Y*KVNSLDYFPSQQ**VY*X 

b TKR*TVWX X SLA MHDEYIKF™ 

C LKGKQFGLFP * PTM MS 1 LMS- 

CATATGAGAATGAGTCAAAGGATCTCGATACATCAGACTTACCAAAGAGAAACTCGCTAT 

481 + + +--.---. + ■ — + 540 

GTATAC TCTTACTCAGTTTCC TAGAGCTATGTAGTCTGAATGGTTTC TGTTTGAGC GATA 

a KMRMSQRXSIHQTYQRQTRY 

b X * E * V K G S RY X R LT K D K L A X 

C YENE.SKDLDTSDLPKTNSL*- 

AAAACGCAAGAAAAAGTrTGA 

541 -f + 4- +■ — - — + 600 

TTTTGCGTTCTTTTTCi\AACTATTAGCTTGTCGTCTTCTTGAATAACGTAAATGATAAGC 

a K T Q B K V * * SHS R RTYC XYYS 

b KRKKKFDNRTAEELIAFTIR- 

c NARKSLII EQQ-KNLLHL.riFV- 

TATGGGTTOTATTACAATC 

ATACCC AAAATAATGTTAAC AAAATC CATAGCTGC G ACTTGAGGGCTC AG AAC TCTGTT A 

a YGFYYNCFRYRR*TPES-*BN 

b MGFI TIVI/GIDGELPSL-ETI- 

c WVX.LQLF*VSTV NSRVLR QL- 

TGAAAAAGCTGTTTACAACTGAAGGAATCGCAGTTCTGAAAGTTCTGATGTGTATGCCAT 

661 — - + — — + — — + - — + 720 

ACTTOTTCGACAAATGTTGACTTCCTTAGCGTCAAGACTTTCAAGACTACACATACGGTA 

a * KSCIrfQLKESQF * KFCCVCK 

b. E K A V Y N * R N R $ S E ■ S SDV YA X 

C K K ■ L F T T B G X A V X, K V L M C M F L - 

TATTTTGTGAATTAATCTCAAATATCTTATCTCAATTTAATGGATAGCTATAGAAACAAA 

72.1 4> — ---- : + 4. 7BQ 

ATAAAACACTTAATTAGAGTTTATAGAATAGAGTTAAATTACCTATCGATATCTTTGTTT 

a Y F V ■ N * S Q X S Y L M L M D S Y R N K 

b XL*INLKYL IS I*WIAXETN- 

C F C E L X S N X L S Q F N G * h * K Q T - 

C CAAATAAACC ATGC AAGTTTAATGGAATATACGTTAAATCC TTTGGGAC AAATGCAG AC 

731 + + 4- + „ + S40 

GGTTTATTOGGTACCTTCAAATmCCTTATArGCAATTTAGGAAACCCTGTTTACGTGTG 

a PHKPCKFNGIYVKSFGT WTA H 

b QINHASLMEYTLNPLGQMHT- 

C K * T M Q V * W N X R * X h W D K C T L - 

TGAATTTATATTGGATTCTTAAAGCATAGATACACAGAATGCTTTAGAGACTGATTTAGC 

841 4- 4- + + „ — 4- + 900 

AC TTAAAT ATAACCTAAGAATTTCGTATC TATGTGTCTTACGAAATCTCTGAC TAAATC G 

a *IYIGFLKHRYT'ECFRD*FS 

fo EFXLDS*SIDTQKTALETDLA- 

c NLY.WILKA* X K R M I* * RLX * L ~~ 
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TTACAACAGATTACCTGTTTTGATTACTCTTGCTCATCTCTTATATCTTTAAAAGAAGCA 

901 + + + + + + 960 

AATGTTGTC TAATGGAC AAAAC TAATGAGAAC GAGTAGAGAATATAGAAATTTTC TTC GT 

a LQQITCFDYSCSSLiISLKEA 
b YNRLPVLITLAHLLYL*KKQ- 
c TTDYLF*LLLL IS YIFKRSR- 

GGC GAAATGAAAAGAAGAC TAAAGAAAGAGATTTC AAAATTTGTTGATTC TTC TGTAAC C 

961 + + + + + + 1020 

CCGCTTTACTTTTCTTCTGATTTCTTTCTCTAAAGTTTTAAACAACTAAGAAGACATTGG 

a GEMKRRLKK EI SKFVDSSVT 
b AK*KED*RKRFQNLLILL* P - 
c RNEKKTKERDFKIC *FFCNR- 

GGAATTAAC AAC AAGAATATTAGC AAC GAAAAAGAAGAAGAGCTATC AC AATC C TGATTC 

1021 + + + + + + 1080 

CCTTAATTGTTGTTCTTATAATCGTTGCTTTTTCTTCTTCTCGATAGTGTTAGGACTAAG 

a GINNKNISNEKEEELSQS*F 
b ELTTRILATKKKKSYHNPDS- 
C N*QQEY*QRKRRRAITILIL- KJ 

TTAAAGATTTCAAAAATTCCAGGTAAGAGAGATACATTCATTAAAATTCATATATTATAG 

1081 + + + + + + 1140 

AATTTCTAAAGTTTTTAAGGTCCATTCTCTCTATGTAAGTAATTTTAAGTATATAATATC 

a LKISKIPGKRDTFIK IHIL* 
b *RFQKFQVREIHSLKFIYYS~ 
c KD F KNS.R * ERY I H * NS Y I I V - 

TTTTTCATTTCACAGCTGTTATTTTCTTTTATCTTAACAATATTTTTTGATTAGCTGGAA 

1141 + + + + - + + ' 1200 

AAAAAGTAAAGTGTC GAC AATAAAAGAAAATAGAATTGTTAT AAAAAAC TAATC GAC C TT 

a FFISQLLFSFILTIFFD*LE 
b F S FH S C YFL LS * QYFLI SWK- 

c FH F T AV I F F Y L NN I F * LAG S - 

GTAAAAAGT ATC AAATAAGAGAAGC GC TAGAC TG AGGTAAC TT AGC TTATTC AC ATTC AT 

1201 + + + + + + 1260 

CATTTTTCATAGTTTATTCTCTTCGCGATCTGACTCCATTGAATCGAATAAGTGTAAGTA 

a VKSIK*EKR*TEVT *LIHIH 
b * KVSNKR SARLR *LSLFTF I - 

c KKYQ I REALD * GNL AYS H S * - 

AGATCGACCTTCATATATCCAATACGATGATAAGGAAACAGCAGTCATCCGTTTTAAAAA 

1261 + + + + + + 1320 

TCTAGCTGGAAGTATATAGGTTATGCTACTATTCCTTTGTCGTCAGTAGGCAAAATTTTT 

a RSTFIYPIR**GNSSHPF*K 
b DRPSYIQYD'DKETAVIRFKN- 
c IDLHISNTMIRKQQ. SSVLKI- 

TAGTGCTATGAGGACTAAATTTTTAGAGTCAAGAAATGGAGCCGAAATCTTAATCAAAAA 

1321 + + + + + + 1380 

ATCACGATACTCCTGATTTAAAAATCTCAGTTCTTTACCTCGGCTTTAGAATTAGTTTTT 

a * CYED * IFRVKKWS RNLNQK 
b SAMRTKFLESRN GAEILIKK- 

C VL * GLNF * SQEMEPKS * SKR- 

F/G. 35 
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GAATTGCGTCGATATTGCAAAAGAATCGAACTCTAAATCTTTCGTTAATAAGTATTACCA- 

1331 + + + + + + 1440 

C TT AAC GC AGC TAT AAC GTTTTC TT AGC TTGAG ATTT AGAAAGC AATT ATT C AT AAT GGT 

a ELRRYCKRIEL * I F R * * V L P 

b NCVD IAKESNSKS FVNKYYQ- 

G IAS ILQKNRTL.NL S L I S I T N - 

ATCTTGATTGATTGAAGAGATTGACGAGGCAACTGCACAGAAGATCATTAAAGAAATAAA 

1441 + + + + + + 1500 

TAGAACTAACTAACTTCTCTAACTGCTCCGTTGACGTGTCTTCTAGTAATTTCTTTATTT 

a ILIDCRDCRGNCTEDH*RNK 

b S * L I E E I D EA T A Q K I I KE I. K - 

c LD * LKRLTRQLHRRSLKK* S - 

GTAAC TTTT ATTAATTAGAGAATAAAC TAAATTAC TAATATAGAG ATC AGC GATC TTC AA 

1501 + + + + + + 1560 

CATTGAAAATAATTAATCTCTTATTTGATTTAATGATTATATCTCTAGTCGCTAGAAGTT 

a VTFIN*RIN*ITNIEISDLQ 

b *LLLIRE*TKLLI*RSAIFN- 

C NFY * L EN KLNY * YRDQR S S I - 

TTGAC GAAAT AAAAGC TGAAC TAAAGTTAGAC AATAAAAAATAC AAAC CTTGGTC AAAAT 

1561 + + + + + + 1620 

AACTGCTTTATTTTCGACTTGATTTCAATCTGTTATTTTTTATGTTTGGAACCAGTTTTA 

a LTK* KLN* S *TIKNTNLGQN 

b *RNKS*TKVRQ*KIQTLV"KI- 

c DE IKAELKLDNKKYKPWSKY- 

ATTGAGGAAGGAAAAGAAGACCAGTTAGCAAAAGAAAAAATAAGGCAATAAATAAAATGA 

1621 + + + - + + + 1680 

TAACTCCTTCCTTTTCTTCTGGTCAATCGTTTTCTTTTTTATTCCGTTATTTATTTTACT 

a IEEG 'KEDQLAKE KIRQ* IK* 

b LRKEKKTS *QKKK* GNK*NE- 

C * GRKRRPVSKRKNKAINKMS- 

GTACAGAAGTGAAGAAATAAAAGATTTATTTTTTTCAATAATTTATTGAAAAGAGGGGTT 

1681 + + + + + + 1740 

CATGTCTTCACTTCTTTATTTTCTAAATAAAAAAAGTTATTAAATAACTTTTCTCGCCAA 

a VQK * RNKRF I FFNNLLKRGV 

b YRSEEIKDLFFSI IY*KEGF- 

C TEVKK * K IYFFQ * F I E. K R G F - 

TTGGGGTTTTGGGGTTTTGGGG 

1741 + + 1762 

AACCCCAAAACCCCAAAACCCC 

aL GF WGFG 
b WGFGVLG 
c G V L G F W 
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2 E VDVQNQADNHG I HS ALKTCEE I KEAKTLYS WI QKVI RCRNQ SQSH YKDL 51 
1^= i :.: = !: =1 |-: = | : = ■ • i |..|-|.| 

19 ELELEMQENQNDIQVRVK . . . . IDDPKQY . . LVNVTAACLLQEGSYYQDK 62 

52 EDIKIFAQTNIVATPRDYNEEDFKVIARKEVF. STGLMIELIDKCLVELL 100 

63 DERRYI ITKALL . . . . EVAESDPEFICQLAVYIRNELYIRTTTNYIVAF . 107 

101 SSSDVSDRQKLQCFGFQLKGNQLAKTHLLTALSTQKQYFFQDEWNQVRAM 15 0 

108 CVVHKNTQPFIEKYFNKAVLLPNDLLEVCEFAQVLYI 144 

151 IGNELFRHLYTKYLIFQRTSEGTLVQFCGNNVFDHLKVNDKFDKKQKGGA 2 0 0 

145 FDATEFKNLY LDRILSQDIRKELTFRKCLQRCVRSKF 181 

201 ADMNE. . .PRCCSTCKYNVKNEKDHFLNNINVPNWNNMKSRTRIFYCTHF 247 

. = :|| - = = |..= . |... ::|. .| ..|:. |.: 
182 SEFNEYQLGKYCTES . . QRKKTMFRYLSVTNKQKWDQTKKK 220 

24 8 NRNNQFFKKHEFVSNKNNISAMDRAQTIFTNIFRFNRIRKKLKDKVIEKI 2 97 

|.= -..| | : :|:..: | ||. |:.|| 

221 . RKENLLTKLQAIKESEDKSKRETG DIMNVEDAIKALKPAVMKKI 264 

2 98 AYMLEKVKDFNFNYYLTKSCPLPENWRERKQKIENLINKTREEKSKYYEE 34 7 

I * * : l :. | ..: |.|'|.| 

2 65 AKRQNAMK KHMKAPKIPNSTLESKYLTFKD 2 94 

34 8 LFSYTTDNKCVTQFINEFFYNILPKDFLTGRNRKNFQKKVKKYVELNKHE 3 97 

2 95 LIKFCHISEP KERVYKILGKKYPKTEEEYKAAFGDSASAPFM . PE 33 8 

3 98 L IHKNLLLE KINTRE I S WMQVETS AKHF YYFDHENI YVLWKLLRWI FEDL 44 7 

I :| -hi ■ =..|. : : .| ... :| | : : 

339 LAGKRMK I E I S KTWENELS AKGNTAE VWDNL I S SNQL P YMAMLRNLSN . . 386 

44 8 VVSLIRCFFYVTEQQKSYSKTYYYRKNIWDVIMKMSIADLKKETLAEVQE 4 97 

387 ILKAGVSD 3 94 

4 98 KEVEEWKKSLGFAPGKLRLIPKKTTFRPIMTFNKKIVNSDRKTTKLTTNT 54 7 

395 TTHS 398 

54 8 KLLNSHLMLKTLKNRMFKDPFGFAVFNYDDVMKKYEEFVCKWKQVGQPKL 597 
I :|. |.|:..|: 

3 99 IVINK I CE PKAVENS KM 415 

598 FFATMDIEKCYDSVNREKLSTFLKTTKLLSSDFWIMTAQILKRKNNIVID 64 7 

I -II I :|..| -= :.:|| |: |:: 

416 F . . PLQFFSAIEAVN . EAVTKGFKAKK. . . RENMNLKGQIEAVKE . .WE 457 

64 8 SKNFRKKEMKDYFRQKFQKIALEGGQYPTLFSVLENEQNDLNAKKTLIVE 697 

Ihl .||.-= .-:..| |. .| 

458 KTDEEKKDM ELEQTEEGEFVKVNEGIGKQYINS IELAIK 496 

6 98 AKQRNYFKKDNLLQPVINICQYNYINFNGKFYKQTKGI PQGLCVSSILSS 74 7 

4 97 I AVNKNLDE I KGHTAI FSDVSGSMSTSMSGGAKKYGS VRTCLECALVLGL 54 6 
74 8 FYYATLEESSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFI 797 
54 7 MVKQRCEKSSFYIFSSPSSQCNKCYLEVDL 576 
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798 EKLINVSRENGFKFNMKK. LQTSFPLSPSKFAKYGMDSVEEQNIVQDYCD 84 6 

577 PGDELRPSMQKLLQEKGKLGGG . . TDFPYECIDEWTKNKTHVD 617 

847 WIGISI DMKTLALMPN INLR I EG I LCTLNLNMQTKKASMWLKKKLKS FLM 8 9 6 

I * I II-. - -Ihl .:|= ||. |. : 

618 NIVILSDMMIAEGYSDINVRGSSIVNSI KKYKDEVN 653 

8 97 NNITHYFRKTITTEDFANKTLNKLFISGGYKYMQCAKEYKD . HFKKNLAM 94 5 

II- : h:: | : : . : . | : . : : : | : : | 

654 PNIKIF. . . AVDLEGYG KCLNLGDE FNENNY I K I FGM 687 

94 6 SSMIDLEVSKIIYSVTRAFFKYLVCNIKDTIFGEEHYPDFFLSTLKHFIE 9 95 

I- I :| = = :::| 
688 SDSI LKF I SAKQGGA NMVE 706 

996 IFSTKKYIFNRVC 1008 

707 VI . . KNFALQKIG 717 

FIG. 36 

(CONTINUED) 

13 2 LSTQKQYFFQDEWNQVRAMIGNEL . FRHLYTKYLIFQRTSE . . GTLVQFC 178 
= 1 --I ..--I I I I I : - | | | . 

1 MSRRNQ KKPQAP I GNETNLDFVLQNLE VYKSQ I EHYKTQQQQ I 4 3 

17 9 GNNVFDHLKVNDKFDKKQKGGAADMNEPRCCSTCKYNVKNEKDHFLNNIN 2 2 8 
- :. II..:. ..|...| :| | .|..:.:|..:| 

4 4 KEEDLKLLKFKNQDQDGNSGNDDDDEE NNSNKQQELLRRVN 84 

22 9 VPNWNNMKSRTRIFYCTHFNRNNQFFKKHEFVSNKNNISAMDRAQTIFTN 278 

h:|| h-| = . -| 

85 Q I KQQVQL I KK . . . VGSKVEKDLNLNEDENKKN 114 

2 7 9 IFRFNRIRKKLKDKVIEKI AYMLEKVKDFNFNYYLTKSCPLPENWRERKQ 3 2 8 

115 GLSEQQVKEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRRETDY 164 

32 9 KIENLINKTREEKSKYYEELFSYTTDNKCVTQFINE. FFYNILPKDFLTG 3 77 

••I-::: -:::| .:|. : |: | : ||::. 

16 5 DTEKWFEI SHDQK NYVS I YANQKTS YCWWLKDYFNK 2 0 0 

3 78 RNRKNFQKKVKKYVELNKHELIHKNLLLEKINTREISWMQVETSAKHFYY 4 2 7 

•I |:. : = :.|. : |= |=.. 
201 NNYDHLNVSINRLE. . TEAEFYAFDDFSQTIKLTNNSYQTVNID 242 

428 FDHENIYVLWKLLRWI . . FEDLWSLIRCFFYVTEQQKSYSKTYYYRKNI 475 

- :| = = ||| = : :| = = lh : I -| . = -|. . ..| 
243 VNFDNNLCILALLRFLLSLERFNILNIRSSY. . TRNQYNFEKIGELLETI 2 90 

4 76 WDVIMKMSIADLKKETLAEVQEKEVEEWKKSLGFAPGKLRLIPKKTTFRP 52 5 

2 91 FAWFSHR HLQGIHLQVPCEAFQYLVNSSSQISVKDSQLQ 3 3 0 

52 6 IMTFNKKIVNSDRKTTKLTTNTKLLNSHLMLKTLKNRMFKDPFGFAVFNY 5 75 

: I-.-: I |.|: • |:|.. -|. :. . . || |. 

331 VYSFSTDLKLVD. . TNKVQDYFKFLQEFPRLTHVSQQAI PVSATNAVENL 378 
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576 DDVMKKYEEFVCKWKQVGQPKLF FATMDIEKCYDS . . VNREK 615 

= - = l I =. f h | :.:| :.::| 

3 79 NVLLKKVKH ANLNLVSI PTQFNFDFYFWLQHLKLEFGLEPNILTKQK 42 6 

516 LSTFL KTTKLLSSDFWIMTAQILKRKNNI . . VIDSKNFRKKEMK 657 

I- hi h. || ||. : ... || = = ... • 

427 LENLLLS I KQS KNLKFLRLNF YTYVAQETSRKQ I LKQATT I KNLKNNKNQ 4 76 

5 5 8 DYFRQKFQKIALEGGQYPTLFSVLEN . . EQNDLNAKKTLI VEAKQRNYFK 7 0 5 

477 EETPETKDETPSESTSGMKFFDHLSELTELEDFSVN LQATQEIY. . 520 

7 06 KDNLLQPVINICQYNYINFNGKFYKQTKGIPQGLCVSSILSSFYYATLEE 755 

I i I -I | |.| |: 

521 .DSLHKLLIRSTNLKKFKLSYKYEMEKSKMDTFIDLKNI YETLNN 564 

756 SSLGFLRDESMNPENPNVNLLMRLTDDYLLITTQENNAVLFIEKLINVSR 3 05 
1= I I -Ih I lh ||. 

5 6 5 LKRCSVNI SNPHGNI SYELTN KDSTFYKFKLTLNQE 5 0 0 

8 06 ENGFKFNMKKLQTSFPLSPSKFAKYGMDSVEEQNIVQDYCDWIGISIDMK 8 55 

h-l |-.|.:. I ||- .| = |. : ::..|--|: : ::. 

601 LQHAKYTFK. . QNEFQFNNVKSAKIESSSLESLEDIDSLCKSIASCKNLQ 64 8 

856 TLALMPNINLRIEGILCTLNLNMQT. . KKASMWLKK . . KLKSFLMNNITH 9 01 

I h---. h- :::|. .||.: .|. 

649 NVKFI IASLLYPNNIQKNPFNKPNLLFFKQFEQLKNLENVSINC 691 

9 02 YFRKTI . . . T T E D FAN KTLNKL F I S GG Y K YMQ C AKE Y KDH F KKNL AM S S M 94 8 

' : * I I I II I -:|: =|.. II- 

6 92 ILDQHILNS I SEFLEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPEL 741 

949 IDLEVSKIIYSVT RAFFKYLVCNIKDT . . IFGEEHY 982 

742 NQVYINQQLEELTVSEVHKQVWENHKQKAFYEPLCEFIKESSQTLQLIDF 791 

983 PDFFLS TLKHFIE1FSTKKY I FNRVCM I LKAKE AKL KS DQ CQ S L I Q 1028 

- =1 -I I =|..|| . |: ... | ..::.|.|:. 

792 DQNTVSDDS IKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQELLK 84 0 

FIG. 37 

(CONTINUED) 

4 DIDLDDIENLLPNTFNKYSSSCSDKKGCKTLKSGSKSPSLTIPK 4 7 
: = ■ '-lh :.: = ... | |.. :||.|.... :|| .|. 

617 NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNIIASLLYPNNIQKNP 6 66 

48 LQKQLEFYFSDANLYNDSFLRKLVLKSGEQRVE. . . . IETLLM 86 

: Ihl - • -h :: = | :|.| :| |..::: 

66 7 FNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEFLEKNKKIKAFIL 716 

FIG. 38 
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telomerase p43 
human La 

Xenopus LaA I 
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S. c. Lhplp 
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ICH2UEYYFGDFNLPR DKFLKEQI . KLDEGWVPLEIMIK 
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ILR2VEYYFGDANLNR3KFLREQIGKNEDGWVPLSVLVT 
CLKQVEFYFSEFISJFPY DRFLRTTAEK . NDGWVPISTIAT 



FIG. 41 



1 aactcattta attactaatt taatcaacaa 
61 tagatttaat ttagaaagta tcaattgaaa 
121 gccaaaagcc gaaaaattgt ggtgggaact 
181 tatataagtt agggttaaga ttgacgatcc 
241 atgtttgttg taggaaggta gttactacta 
301 taaagcactt cttgaggtgg ctgagtctga 
361 catccgtaat gaactttaca tcagaactac 
421 ccacaagaat actcaaccat tcatcgaaaa 
481 tgacttactg gaagtctgtg aatttgcata 
541 caaaaatttg tatcttgata ggatactttc 
601 taagtgttta caaagatgcg tcagaagcaa 
661 taagtattgc actgaatcct aacgtaagaa 
721 caagtaaaag tgggattaaa ctaagaagaa 
781 ggcaataaag gaatctgaag ataagtccaa 
841 agatgcaatc aaggctttaa aaccagcagt 
901 catgaagaaa cacatgaagg cacctaaaat 
961 gaccttcaag gatctcatta agttctgcca 
1021 gatccttggt aaaaaatacc ctaagaccga 
1081 tgcatctgca cccttcaatc ctgaattggc 
1141 aacatgggaa aatgaactca gtgcaaaagg 
1201 ttcaagcaat taactcccat atatggccat 
1261 cggtgtttca gatactacac actctattgt 
1321 tgagaactcc aagatgttcc ctcttcaatt 
1381 agttactaag ggattcaagg ccaagaagag 
1441 agcagtaaag gaagttgttg aaaaaaccga 
1501 aaccgaagaa ggagaatttg ttaaagtcaa 
1561 cattgaactt gcaatcaaga tagcagttaa 
1621 tgcaatcttc tctgatgttt ctggttctat 
1681 gtatggttcc gttcgtactt gtctcgagtg 
1741 acgttgtgaa aagtcctcat tctacatctt 
1801 ttacttagaa gttgatctcc ctggagacga 
1861 agagaaagga aaacttggtg gtggtactga 
1921 aaagaataaa actcacgtag acaatatcgt 
1981 atattcagat atcaatgtta gaggcagttc 
2041 tgaagtaaat cctaacatta aaatctttgc 
2101 taatctaggt gatgagttca atgaaaacaa 
2161 aatcttaaag ttcatttcag ccaagcaagg 
2221 ctttgccctt caaaaaatag gacaaaagtg 
2281 ctcaccccac ttttttgttt tattgcatag 
2341 atttaagtta cttacatagt ttatgtatcg 
2401 aaagaacaaa aaagattaaa a 



gattgataaa aagcagtaaa taaaacccaa 
aatggaaatt gaaaacaact aagcacaata 
tgaattagag atgcaagaaa accaaaatga 
taagcaatat ctcgtgaacg tcactgcagc 
agataaagat gaaagaagat atatcatcac 
tcctgagttc atctgctagt tggcagtcta 
cactaactac attgtagcat tttgtgttgt 
gtacttcaac aaagcagtac ttttgcctaa 
ggttctctat atttttgatg caactgaatt 
ataagatatt cgtaaggaac tcactttccg 
gttttctgaa ttcaacgaat actaacttgg 
aacaatgttc cgttacctct cagttaccaa 
gagaaaagag aatctcttaa ccaaacttta 
gagagaaact ggagacataa tgaacgttga 
tatgaagaaa atagccaaga gatagaatgc 
tcctaactct accttggaat caaagtactt 
tatttctgag cctaaagaaa gagtctataa 
agaggaatac aaagcagcct ttggtgattc 
tggaaagcgt atgaagattg aaatctctaa 
caacactgct gaggtttggg ataatttaat 
gttacgtaac ttgtctaaca tcttaaaagc 
gatcaacaag atttgtgagc ccaaggccgt 
ctttagtgcc attgaagctg ttaatgaagc 
agaaaatatg aatcttaaag gtcaaatcga 
tgaagagaag aaagatatgg agttggagta 
cgaaggaatt ggcaagcaat acattaactc 
caagaattta gatgaaatca aaggacacac 
gagtacctca atgtcaggtg gagccaagaa 
tgcattagtc cttggtttga tggtaaaata 
cagttcacct agttctcaat gcaataagtg 
actccgtcct tctatgtaaa aacttttgca 
tttcccctat gagtgcattg atgaatggac 
tattttgtct gatatgatga ttgcagaagg 
cattgttaac agcatcaaaa agtacaagga 
agttgactta gaaggttacg gaaagtgcct 
ctacatcaag atattcggta tgagcgattc 
aggagcaaat atggtcgaag ttatcaaaaa 
agtttcttga gattcttcta taacaaaaat 
ccattatgaa atttaaatta ttatctattt 
cagtctatta gcctattcaa atgattctgc 



FIG. 42 
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ME I ENNQAQ Q PKAE KL WWELE LEMQENQND I Q VR VK I DD P KQ YL 
VNVTAACLLQEGS YYQDKDE RRYI I TKALLE VAE SDPE F I CQLA 
VYIRNELYIRTTTNYIVAFCVVHKNTQPFIEKYFNKAVLjLPNDL 
LEVCEFAQVLYIFDATEFKNLYLDRILSQDIRKELTFRKCLQRC 
VRSKFSEFNEYQLGKYCTESQRKKTMFRYLSVTNKQKWDQTKKK 
RKENLLTKLQAIKESEDKSKRETGDIMNVEDAIKALKPAVMKKI 
AKRQNAMKKHMKAPKIPNSTLESKYLTFKDLIKFCHISEPKERV 
YKILGKKYPKTEEEYKAAFGDSASAPFNPELAGKRMKIEISKTW 
ENELSAKGNTAEVWDNLISSNQLPYMAMLRNLSNILKAGVSDTT 
HSIVINKICEPKAVENSKMFPLQFFSAIEAVNEAVTKGFPCAKKR 
ENMNLKGQIEAVKEVVE KTDEEKKDMELEQTEEGEFVKVNEGIG 
KQYINS IELAIKIAVNKNLDEIKGHTAIFSDVSGSMSTSMSGGA 
KKYGSVRTCLECALVLGLMVKQRCEKSSFYIFSSPSSQCNKCYL 
EVDLPGDELRPSMQKLLQEKGKLGGGTDFPYECIDEWTKNKTHV 
DNIVILSDMMIAEGYSDINVRGSSIVNSIKKYKDEVNPNIKIFA 
VDLEGYGKCLNLGDE FNENNY I KI FGMSDS ILKFI S AKQGGANM 
VEVIKNFALQKIGQK 

FIG. 43 

MSRRNQKKPQAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIK 
EEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRVNQIKQ 
QVQL I KKVGS KVEKDLNLNEDENKKNGLSEQQVKEEQLRT I TEE 
QVKYQNLVFNMDYQLDLNESGGHRRHRRETDYDTEKWFE ISHDQ 
KNYVS I YANQKTSYCWWLKDYFNKNNYDHLNVS INRLETEAEFY 
AFDDFSQTIKLTNNSYQTVNIDVNFDNNLCILALLRFLLSLERF 
N I LNT RS S YTRNQYNFE KI GELLET I FAVVFSHRHLQGI HLQVP 
CEAFQYLVNSSSQISVKDSQLQVYSFSTDLKLVDTNKVQDYFKF 
LQEFPRLTHVSQQAI PVSATNAVENLNVLLKKVKHANLNLVS I P 
TQFNFDFYFVNLQHLKLEFGLEPNILTKQKLENLLLSIKQSKNL 
KFLRLNFYTYVAQETSRKQILKQATTIKNLKNNKNQEETPETKD 
ETPSESTSGMKFFDHLSELTELEDFSVNLQATQEIYDSLHKLLI 
RSTNLKKFKLSYKYEMEKSKMDTFIDLKNIYETLNNLKRCSVNI 
SNPHGNISYELTNKDSTFYKFKLTLNQELQHAKYTFKQNEFQFN 
NVKSAKIESSSLESLEDIDSLCKSIASCKNLQNVNI IASLLYPN 
NIQKNPFNKPNLLFFKQFEQLKNLENVSINCILDQHILNSISEF 
LEKNKKIKAFILKRYYLLQYYLDYTKLFKTLQQLPELNQVYINQ 
QLEELTVSE VHKQVWENHKQKAFYE PLCE F I KES SQTLQL I DFD 
QNTVSDDSIKKILESISESKYHHYLRLNPSQSSSLIKSENEEIQ 
ELLKACDEKGVLVKAYYKFPLCLPTGTYYDYNSDRW 

F/G. 45 

MKILFEFIQDKLDIDLQTNSTYKENLKCGHFNGLDEILTTCFAL 
PNSRKIALPCLPGDLSHKAVIDHCI IYLLTGELYNNVLTFGYKI 
ARNEDVNNSLFCHSANVNVTLLKGAAWKMFHSLVGTYAFVDLLI 
N YT VI Q FNGQ F FTQ I VGNRCNE PHL P PKW VQR S S S S S AT AAQ I K 
QLTEPVTNKQFLHKLNINSSSFFPYSKILPSSSS IKKLTDLREA 
I FPTNLVKI PQRLKVRINLTLQKLLKRHKRLNYVS ILNS ICPPLt 
EGTVLDLSHLSRQSPKERVLKFIIVILQKLLPQEMFGSKKNKGK 
IIKNLNLLLSLPLNGYLPFDSLLKKLRLKDFRWLFISDIWFTKH 
NFENLNQLAICFISWLFRQIjI PKI IQTFFYCTEISSTVTIVYFR 
HDTWNKLITPFIVEYFKTYLVENWVCRNHNSYTLSNFNHSKMRI 
I PKKSNNEFRI IAI PCRGADEEEFTI YKENHKNAIQPTQKILEY 
LRNKRPTSFTKIYSPTQIADRIKEFKQRLLKKFNNVLPELYFMK 
FDVKSCYDS I PRMECMRILKDALKNENGFFVRSQYFFNTNTGVL 
KL FNWNAS R VPKP YE L Y I DN VRT VHLSNQD V I NVVEME I FKTA 
LWVEDKCYIREDGLFQGSSLSAPIVDLVYDDLLEFYSEFKASPS 
QDTLtlLKLADDFLIISTDQQQVINIKKLAMGGFQKYNAKANRDK 
ILAVSSQSDDDTVIQFCAMHIFVKELEVWKHSSTMNNFHIRSKS 
S KGI FRS L I ALFNTR I S YKT I DTNLNS TNT VLMQ I DH WKN I S E 
CYKSAFKDLS INVTQNMQFHSFLQRI IEMTVSGCPITKCDPLIE 
YEVRFTILNGFLESLSSNTSKFKDNIILLRKEIQHLQAYIYIYI 
HIVN 
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1 tcaatactat taattaataa ataaaaaaaa 
61 ctaaaaaaag ccataggctc ctataggcaa 
121 tctagaagtt tacaaaagcc agattgagca 
181 ggaggatctc aagcttttaa agttcaaaaa 
241 tgatgatgat gaagaaaaca actcaaataa 
301 gattaagtag caagtttaat tgataaaaaa 
361 tttgaacgaa gatgaaaaca aaaagaatgg 
421 attaagaacg attactgaag aataggttaa 
481 ccagttagat ttaaatgaga gtggtggcca 
541 tactgaaaaa tggtttgaaa tatctcatga 
601 ctaaaagaca tcatattgtt ggtggcttaa 
661 tcttaatgta agcattaaca gactagaaac 
721 ttcacaaaca atcaaactta ctaataattc 
781 tgataataat ctctgtatac tcgcattgct 
841 tattttgaat ataagatctt cttatacaag 
901 gctacttgaa actatcttcg cagttgtctt 
961 acaagttcct tgcgaagcgt tctaatattt 
1021 agatagctaa ttataggtat actctttctc 
1081 agtccaagat tattttaagt tcttataaga 
1141 ggctatccca gttagtgcta ctaacgctgt 
1201 caagcatgct aatcttaatt tagtttctat 
1261 tgttaattta taacatttga aattagagtt 
1321 aaagcttgaa aatctacttt tgagtataaa 
1381 aaacttttac acctacgttg cttaagaaac 
1441 aacaatcaaa aatctcaaaa acaataaaaa 
1501 aactccaagc gaaagcacaa gtggtatgaa 
1561 gcttgaagat ttcagcgtta acttgtaagc 
1621 acttttgatt agatcaacaa atttaaagaa 
1681 aaagagtaaa atggatacat tcatagatct 
1741 taaaagatgc tctgttaata tatcaaatcc 
1801 taaagattct actttttata aatttaagct 
1861 gtatactttt aagtagaacg aattttaatt 
1921 ttcctcatta gaaagcttag aagatattga 
1981 aaatttacaa aatgttaata ttatcgccag 
2041 tcctttcaat aagcccaatc ttctattttt 
2101 aaatgtatct atcaactgta ttcttgatca 
2161 agaaaagaat aaaaaaataa aagcattcat 
2221 tcttgattat actaaattat ttaaaacact 
2281 cattaattag caattagaag aattgactgt 
2341 ccacaagcaa aaagctttct atgaaccatt 
2401 cctttagcta atagattttg accaaaacac 
2461 agaatctata tctgagtcta agtatcatca 
2521 cagtttaatt aaatctgaaa acgaagaaat 
2581 aggtgtttta gtaaaagcat actataaatt 
2641 cgattacaat tcagatagat ggtgattaat 
2701 tgaatatttc tttgcttatt atttgaataa 
2761 atatatttta gttatttaat tcattatttt 
2821 aaaaaatcg 



gcaaactaca aagaaaatgt caaggcgtaa 
tgaaacaaat cttgattttg tattacaaaa 
ttataagacc tagtagtaat agatcaaaga 
ttaagattag gatggaaact ctggcaacga 
ataataagaa ttattaagga gagtcaatta 
agttggttct aaggtagaga aagatttgaa 
actttctgaa tagcaagtga aagaagagta 
gtattaaaat ttagtattta acatggacta 
tagaagacac agaagagaaa cagattatga 
ccaaaaaaat tatgtatcaa tttacgccaa 
agattatttt aataaaaaca attatgatca 
tgaagccgaa ttctatgcct ttgatgattt 
ttactagact gttaacatag acgttaattt 
tagattttta ttatcactag aaagattcaa 
aaattaatat aattttgaga aaattggtga 
ttctcatcgc cacttacaag gcattcattt 
agttaactcc tcatcataaa ttagcgttaa 
tacagactta aaattagttg acactaacaa 
attccctcgt ttgactcatg taagctagta 
agagaacctc aatgttttac ttaaaaaggt 
ccctacctaa ttcaattttg atttctactt 
tggattagaa ccaaatattt tgacaaaaca 
ataatcaaaa aatcttaaat ttttaagatt 
ctccagaaaa cagatattaa aacaagctac 
tcaagaagaa actcctgaaa ctaaagatga 
attttttgat catctttctg aattaaccga 
tacccaagaa atttatgata gcttgcacaa 
gttcaaatta agttacaaat atgaaatgga 
taagaatatt tatgaaacct taaacaatct 
tcatggaaac atttcttatg aactgacaaa 
gaccttaaac taagaattat aacacgctaa 
taataacgtt aaaagtgcaa aaattgaatc 
tagtctttgc aaatctattg cttcttgtaa 
tttgctctat cccaacaata tttagaaaaa 
caagcaattt gaataattga aaaatttgga 
gcatatactt aattctattt cagaattctt 
tttgaaaaga tattatttat tacaatatta 
tcaatagtta cctgaattaa attaagttta 
gagtgaagta cataagtaag tatgggaaaa 
atgtgagttt atcaaagaat catcctaaac 
tgtaagtgat gactctatta aaaagatttt 
ttatttgaga ttgaacccta gttaatctag 
ttaagaactt ctcaaagctt gcgacgaaaa 
ccctctatgt ttaccaactg gtacttatta 
taaatattag tttaaataaa tattaaatat 
tacatacaat agtcattttt agtgttttga 
aagtaaataa t tattt ttca atcatttttt 



FIG. 44 
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Oxytricha LCVSYILSSFYYANLEENALQFLRKESMDPEKPETNLLMRLT 
Euplotes LCVSSILSSFYYATLEESSLGFLRDESMNPENPNVNLLMRLT 
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ATTTATACTCATGAAAATCTTATTCGAGTTCATTCAAGACAAGCTTGACATTGATCTACA 

GACCAACAGTACTTACAAAGAAAATTTAAAATGTGGTCACTTCAATGGCCTCGATGAAAT 

TGTAACTACGTGTTTCGCACTACCAAATTCAAGAAAAATAGCATTACCATGCCTTCCTGG 

TGACTTAAGCCACAAAGCAGTCATTGATCACTGCATCATTTACCTGTTGACGGGCGAATT 

ATACAACAACGTACTAACATTTGGCTATAAAATAGCTAGAAATGAAGATGTCAACAATAG 

TCTTTTTTGCCATTCTGCAAATGTTAACGTTACGTTACTGAAAGGCGCTGCTTGGAAAAT 

GTTCCACAGTTTGGTCGGTACATACGCATTCGTTGATTTATTGATCAATTATACAGTAAT 

TCAATTTAATGGGCAGTTTTTCACTCAAATCGTGGGTAACAGATGTAACGAACCTCATCT 

GCCGCCCAAATGGGTCCAACGATCATCCTCATCATCCGCAACTGCTGCGCAAATCAAACA 

ACTTACAGAACCAGTGACAAATAAACAATTCTTACACAAGCTCAATATAAATTCCTCTTC 

TTTTTTTCCTTATAGCAAGATCCTTCCTTCATCATCATCTATCAAAAAGCTAACTGACTT 

GAGAGAAGCTATTTTTCCCACAAATTTGGTTAAAATTCCTCAGAGACTAAAGGTACGAAT 

TAATTTGACGCTGCAAAAGCTATTAAAGAGACATAAGCGTTTGAATTACGTTTCTATTTT 

GAATAGTATTTGCCCACCATTGGAAGGGACCGTATTGGACTTGTCGCATTTGAGTAGGCA 

ATCACCAAAGGAACGAGTCTTGAAATTTATCATTGTTATTTTACAGAAGTTATTACCCCA 

AGAAATGTTTGGCTCAAAGAAAAATAAAGGAAAAATTATCAAGAATCTAAATCTTTTATT 

AAGTTTACCCTTAAATGGCTATTTACCATTTGATAGTTTGTTGAAAAAGTTAAGATTAAA 

GGATTTTCGGTGGTTGTTCATTTCTGATATTTGGTTCACCAAGCACAATTTTGAAAACTT 

GAATCAATTGGCGATTTGTTTCATTTCCTGGCTATTTAGACAACTAATTCCCAAAATTAT 

ACAGACTTTTTTTTACTGCACCGAAATATCTTCTACAGTGACAATTGTTTACTTTAGACA 

TGATACTTGGAATAAACTTATCACCCCTTTTATCGTAGAATATTTTAAGACGTACTTAGT 

CGAAAACAACGTATGTAGAAACCATAATAGTTACACGTTGTCCAATTTCAATCATAGCAA 

AATGAGGATTATACCAAAAAAAAGTAATAATGAGTTCAGGATTATTGCCATCCCATGCAG 

AGGGGCAGACGAAGAAGAATTCACAATTTATAAGGAGAATCACAAAAATGCTATCCAGCC 

CACTCAAAAAATTTTAGAATACCTAAGAAACAAAAGGCCGACTAGTTTTACTAAAATATA 

TTCTCCAACGCAAATAGCTGACCGTATCAAAGAATTTAAGCAGAGACTTTTAAAGAAATT 

TAATAATGTCTTACCAGAGCTTTATTTCATGAAATTTGATGTCAAATCTTGCTATGATTC 

CATACCAAGGATGGAATGTATGAGGATACTCAAGGATGCGCTAAAAAATGAAAATGGGTT 

TTTCGTTAGATCTCAATATTTCTTCAATACCAATACAGGTGTATTGAAGTTATTTAATGT 

TGTTAACGCTAGCAGAGTACCAAAACCTTATGAGCTATACATAGATAATGTGAGGACGGT 

TCATTTATCAAATCAGGATGTTATAAACGTTGTAGAGATGGAAATATTTAAAACAGCTTT 

GTGGGTTGAAGATAAGTGCTACATTAGAGAAGATGGTCTTTTTCAGGGCTCTAGTTTATC 

TGCTCCGATCGTTGATTTGGTGTATGACGATCTTCTGGAGTTTTATAGCGAGTTTAAAGC 

CAGTCCTAGCCAGGACACATTAATTTTAAAACTGGCTGACGATTTCCTTATAATATCAAC 

AGACCAACAGCAAGTGATCAATATCAAAAAGCTTGCCATGGGCGGATTTCAAAAATATAA 

TGCGAAAGCCAATAGAGACAAAATTTTAGCCGTAAGCTCCCAATCAGATGATGATACGGT 

TATTCAATTTTGTGCAATGCACATATTTGTTAAAGAATTGGAAGTTTGGAAACATTCAAG 

CACAATGAATAATTTCCATATCCGTTCGAAATCTAGTAAAGGGATATTTCGAAGTTTAAT 

AGCGCTGTTTAACACTAGAATCTCTTATAAAACAATTGACACAAATTTAAATTCAACAAA 

CACCGTTCTCATGCAAATTGATCATGTTGTAAAGAACATTTCGGAATGTTATAAATCTGC 

TTTTAAGGATCTATCAATTAATGTTACGCAAAATATGCAATTTCATTCGTTCTTACAACG 

CATCATTGAAATGACAGTCAGCGGTTGTCCAATTACGAAATGTGATCCTTTAATCGAGTA 

TGAGGTACGATTCACCATATTGAATGGATTTTTGGAAAGCCTATCTTCAAACACATCAAA 

ATTTAAAGATAATATCATTCTTTTGAGAAAGGAAATTCAACACTTGCAAGC 

FIG. 48 
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AKFLHWLMSVYVVELLRSFFYVTETTFQKNRLFFYRKSVWSKLQSIGIRQHLKR 
VQLRDVSEAEVRQHREARPALLTSRLRFIPKPDGLRPIWMDYVVGARTFRREK 
RAERLTSRVKALFSVLNYERA 

FIG. 49 



GCCAAGTTCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTC 
TTTCTTTTATGTCACGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACC 
GGAAGAGTGTCTGGAGCAAGTTGCAAAGCATTGGAATCAGACAGCACTTGAAG 
AGGGTGCAGCTGCGGGACGTGTCGGAAGCAGAGGTCAGGCAGCATCGGGAAGC 
CAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCATCCCCAAGCCTGACGGGC 
TGCGGCCGATTGTGAACATGGAGTACGTCGTGGGAGCCAGAACGTTCCGCAGA 
GAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCGTGCT 
CAACTACGAGCGGGCGCG 
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MTEHHTPKSRILRFLENQYVYLCTLNDYVQLVLRGSPASSYSNICERLRSDVQTSFSIFLHSTVVGF 
DSKPDEGVQFSSPKCSQSELIANVVKQMFDESFERRRNLLMKGFSMNHEDFRAMHVNGVQNDLVSTF 
PNYLISILESKNWQLLLEIIGSDAMHYLLSKGSIFEALPNDNYLQISGIPLFKNNVFEETVSKKRKR 
T I ETS I TQNKSARKEVS WNS ISISRFSI FYRS S YKKFKQDLYFNLHS I CDRNT VHMWLQW I FPRQFG 
LINAFQVKQLHKVIPLVSQSTVVPKRLLKVYPLIEQTAKRLHRISLSKVYNHYCPYIDTHDDEKILS 
YSLKPNQVFAFLRSILVRVFPKLIWGNQRIFEI ILKDLETFLKLjSRYESFSLHYLMSNIKI SEIEWL 
VLGKRSNAKMCLSDFEKRKQIFAEFI YWLYNSFI IPILQSFFYITESSDLRNRTVYFRKDIWKLLCR 
PFITSMKMEAFEKINENNVRMDTQKTTLPPAVIRLLPKKNTFRLITNLRKRFLIKMGSNKKMLVSTN 
QTLRPVASILKHLINEESSGIPFNLEVYMKLLTFKKDLLKHRMFGRKKYFVRIDIKSCYDRIKQDLM 
FRIVKKKLKDPEFVIRKYATIHATSDRATKNFVSEAFSYFDMVPFEKVVQLLSMKTSDTLFVDFVDY 
WTKSSSEIFKMLKEHLSGHIVKIGNSQYLQKVGIPQGSILSSFLCHFYMEDLIDEYLSFTKKKGSVL 
LRWDDFLFITVNKKDAKKFLNLSLRGFEKHNFSTSLEKTVINFENSNGI INNTFFNESKKRMPFFG 
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S-l: FFY VTE TTF QKN RLF FYR KSV WSK 

S-2: RQH LKR VQL RDV SEA EVR QHR EA 

S-3: ART FRR EKR AER LTS RVK ALF SVL NYE 
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A- 2 : LFF YRK 
A- 3 : PAL LTS 
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Wild Type Telomerase Gene 
Wild Type Telomerase Gene 



1 . Transform with linear fragment 
containing the telomerase gene 
disrupting with a LEU2 or ura4 marker 



- pvT 



Telomerase Gene::M 



Wild Type Telomerase Gene 




2. Assay in selective media 



3. Sporulate, and grow on 
selective media 



Wild Type Telomerase Gene 



M 



Telomerase Gene::M 



Wild Type (X2) 



Senescence? (X2) 



(These cells will show a senescence phenotype 

if the disrupted gene encodes a telomerase subunit.) 



FIG. 65 
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1 

met ser val tyr val val glu leu leu 
GCCAAGTTCCTGCACTGGCTG ATG AGT GTG TAC GTC GTC GAG CTG CTC 

10 20 
arg ser phe phe tyr val thr glu thr thr phe gin lys asn arg 
AGG TCT TTC TTT TAT GTC ACG GAG ACC ACG TTT CAA AAG AAC AGG 

30 

leu phe phe tyr arg lys ser val trp ser lys leu gin ser ile 
CTC TTT TTC TAC CGG AAG AGT GTC TGG AGC AAG TTG CAA AGC ATT 

40 50 
gly ile arg gin his leu lys arg val gin leu arg glu leu ser 
GGA ATC AGA CAG CAC TTG AAG AGG GTG CAG CTG CGG GAG CTG TCG 

60 

glu ala glu val arg gin his arg glu ala arg pro ala leu leu 
GAA GCA GAG GTC AGG CAG CAT CGG GAA GCC AGG CCC GCC CTG CTG 

70 80 
thr ser arg leu arg phe ile pro lys pro asp gly leu arg pro 
ACG TCC AGA CTC CGC TTC ATC CCC AAG CCT GAC GGG CTG CGG CCG 

90 

ile val asn met asp tyr val val gly ala arg thr phe arg arg 
ATT GTG AAC ATG GAC TAC GTC GTG GGA GCC AGA ACG TTC CGC AGA 

100 no 

glu lys ala glu arg leu thr ser arg val lys ala leu phe 

GAA AAG ARG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG TTC 

120 

ser val leu asn tyr glu arg ala arg arg pro gly leu leu gly 
AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG GGC 

130 140 

ala ser val leu gly leu asp asp ile his arg ala trp arg thr 
GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC ACC 

150 

phe val leu arg val arg ala gin asp pro pro pro glu leu tyr 
TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG TAC 

160 170 

phe val lys val asp val thr gly ala tyr asp thr ile pro gin 
TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC CAG 

180 

asp arg leu thr glu val ile ala ser ile ile lys pro gin asn 
GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG AAC 

190 200 

thr tyr cys val arg arg tyr ala val val gin lys ala ala met 
ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC ATG 



FIG. 68 




gly thr ser 
GGC ACG TCC 

220 

gin gly ile 
CAG GGG ATC 



leu cys tyr 
CTG TGC TAC 

250 

arg asp gly 
CGG GAC GGG 



thr pro his 
ACA CCT CAC 

280 

arg gly val 
CGA GGT GTC 



val val asn 
GTG GTG AAC 

310 

phe val gin 
TTT GTT CAG 



leu leu asp 
CTG CTG GAT 

340 

tyr ala arg 
TAT GCC CGG 



phe lys ala 
TTC AAG GCT 

370 

arg leu lys 
CGG CTG AAG 



leu gin thr 
CTC CAG ACG 

400 

ala tyr arg 
GCG TAC AGG 



210 

ala arg pro 
GCA AGG CCT 



pro gin gly 
CCG CAG GGC 

240 

gly asp met 
GGC GAC ATG 



leu leu leu 
CTG CTC CTG 

270 

leu thr his 
CTC ACC CAC 



pro glu tyr 
CCT GAG TAT 

300 

phe pro val 
TTC CCT GTA 



met pro ala 
ATG CCG GCC 

330 

thr arg thr 
ACC CGG ACC 



thr ser ile 
ACC TCC ATC 

360 

gly arg asn 
GGG AGG AAC 



cys his ser 
TGT CAC AGC 

390 

val cys thr 
GTG TGC ACC 



phe his ala 
TTT CAC GCA 



ser arg ala 
TCA AGA GCC 



ser ile leu 
TCC ATC CTC 



glu asn lys 
GAG AAC AAG 



arg leu val 
CGT TTG GTG 



ala lys thr 
GCG AAA ACC 



gly cys val 
GGC TGC GTG 



glu asp glu 
GAA GAC GAG 



his gly leu 
CAC GGC CTA 



leu glu val 
CTG GAG GTG 



arg ala ser 
AGA GCC AGT 



met arg arg 
ATG CGT CGC 



leu phe leu 
CTG TTT CTG 



asn ile tyr 
AAC ATC TAC 



cys val leu 
TGT GTG CTG 



thr ser tyr 
ACG TCC TAC 

230 

ser thr leu 
TCC ACG CTG 



leu phe ala 
CTG TTT GCG 

260 

asp asp phe 
GAT GAT TTC 



phe leu arg 
TTC CTC AGG 

290 

val asn leu 
GTG AAC TTG 



ala leu gly 
GCC CTG GGT 

320 

phe pro trp 
TTC CCC TGG 



gin ser asp 
CAG AGC GAC 

350 
leu thr phe 
CTC ACC TTC 



lys leu phe 
AAA CTC TTT 

380 

asp leu gin 
GAT TTG CAG 



lys ile leu 
AAG ATC CTC 

410 

gin leu pro 
CAG CTC CCA 



val gin cys 
GTC CAG TGC 



leu cys ser 
CTC TGC AGC 



gly ile arg 
GGG ATT CGG 



leu leu val 
TTG TTG GTG 



thr leu val 
ACC CTG GTC 



arg lys thr 
CGG AAG ACA 



gly thr ala 
GGC ACG GCT 



cys gly leu 
TGC GGC CTG 



tyr ser ser 
TAC TCC AGC 



asn arg gly 
AAC CGC GGC 



gly val leu 
GGG GTC TTG 



val asn ser 
GTG AAC AGC 



leu leu gin 
CTG CTG CAG 



phe his gin 
TTT CAT CAG 
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420 

gin val trp lys asn pro his phe ser cys ala ser ser leu thr 
CAA GTT TGG AAG AAC CCA CAT TTT TCC TGC GCG TCA TCT CTG ACA 



arg leu pro leu leu leu his pro glu ser gin glu arg arg asp 
CGG CTC CCT CTG CTA CTC CAT CCT GAA AGC CAA GAA CGC AGG GAT 

450 

val ala gly gly gin gly arg arg arg pro ser ala leu arg gly 
GTC GCT GGG GGC CAA GGG CGC CGC CGG CCC TCT GCC CTC CGA GGC 

460 470 

arg ala val ala val pro pro ser ile pro ala gin ala asp ser 
CGT GCA GTG GCT GTG CCA CCA AGC ATT CCT GCT CAA GCT GAC TCG 

480 

thr pro cys his leu arg ala thr pro gly val thr gin asp ser 
ACA CCG TGT CAC CTA CGT GCC ACT CCT GGG GTC ACT CAG GAC AGC 

490 500 

pro asp ala ala glu ser glu ala pro gly asp asp ala asp cys 
CCA GAC GCA GCT GAG TCG GAA GCT CCC GGG GAC GAC GCT GAC TGC 

510 

pro gly gly arg ser gin pro gly thr ala leu arg leu gin asp 
CCT GGA GGC CGC AGC CAA CCC GGC ACT GCC CTC AGA CTT CAA GAC 

520 530 

his pro gly leu met ala thr arg pro gin pro gly arg glu gin 
CAT CCT GGA CTG ATG GCC ACC CGC CCA CAG CCA GGC CGA GAG CAG 

540 

thr pro ala ala leu ser arg arg ala tyr thr ser gin gly gly 
ACA CCA GCA GCC CTG TCA CGC CGG GCT TAT ACG TCC CAG GGA GGG 

550 560 

arg gly gly pro his pro gly leu his arg trp glu ser glu ala 
AGG GGC GGC CCA CAC CCA GGC CTG CAC CGC TGG GAG TCT GAG GCC 

564 
OP 

TGA GTGAGTGTTTGGCCGAGGCCTGCATGTCCGGCTGAAGGCTGAGTGTCCGGCTGAGGC 

CTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTCCAGCACACCTGCGTTTTCACTTCCCCAC 

AGGCTGGCGTTCGGTCCACCCCAGGGCCAGCTTTTCCTCACCAGGAGCCCGGCTTCCACT 

CCCCACATAGGAATAGTCCATCCCCAGATTCGCCATTGTTCACCCTTCGCCCTGCCTTCC 

TTTGCCTTCCACCCCCACCATTCAGGTGGAGACCCTGAGAAGGACCCTGGGAGCTTTGGG 

AATTTGGAGTGACCAAAGGTGTGCCCTGTACACAGGCGAGGACCCTGCACCTGGATGGGG 

GTCCCTGTGGGTCAAATTGGGGGGAGGTGCTGTGGGAGTAAAATACTGAATATATGAGTT 

TTTCAGTTTTGGAAAAAAAAAAAAAAAAAAAAAAAAAA 



430 



440 
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Motif -1 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



. LVVSLIRCFFYVTEQQKSYSKT . 
. FIIPILQSFFYITESSDLRNRT. 
. LIPKIIQTFFYCTEISSTVTIV. 
. YVVELLRS FFYVTETTFQKNRL . 
FFY TE 



Motif 0 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



K 

p hhh K hR h R 

. KSLGFAPGKLRLIPKKT- -TFRPIMTFNKKIV 
. QKTTLPPAVIRLLPKKN- - TFRLITNLRKRFL 
. TLSNFNHSKMRI IPKKSNNEFRI IAI PCRGAD 
. ARPALLTSRLRFIPKPD- -GLRPIVNMDYVVG , 
R PK R I 



Motif A 
Ep pi 2 3 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



AF 

h hDh GY h 
. PKLFFATMDIEKCYDSVNREKLSTFLK . 
. RKKYFVRIDIKSCYDRIKQDLMFRIVK . 
, PELYFMKFDVKSCYDSIPRMECMR1LK. 
PELYFVKVDVTGAYDTIPQDRLTEV1A . 

F D YD 



.//, 



Motif B 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



hPQG pS hh 
. NGKFYKQTKGIPQGLCVSSILSSFYYA . 
. GNSQYLQKVGIPQGSILSSFLCHFYME . 
. EDKCYIREDGLFQGSSLSAPIVDLVYD . 
. RATSYVQCQG1PQGSILSTLLCSLCYG . 
G QG S 



Motif C 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



h F DD hhh 
, PNVNLLMRLTDDYLLITTQENN . . 
, KKGSVLLRWDDFLFITVNKKD. . 
, SQDTLILKLADDFLI ISTDQQQ . . 
. RRDGLLLRLVDDFLLVTPHLTH . . 
DD L 



Motif D 
Ep pl23 
Sp Tezl 
Sc Est2 
Hs TCP1 
consensus 



Gh h cK 
. NVSRE3XTGFKFNMKKL . 
. LNLSLRGFEKHNFST . 
. KKLAMGGFQKYNAKA . 
. LRTLVRGVPEYGCW . 

G 
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1 GCAGCGCTGC GTCCTGCTGC 

51 CCGCGATGCC GCGCGCTCCC 

101 AGCCACTACC GCGAGGTGCT 

151 GCCCCAGGGC TGGCGGCTGG 

2 01 CGNTGGTGGC CCANTGCNTG 
251 CCCGCCGCCC CCTCCTTCCG 

3 01 CCGAGTGCTG CANANGCTGT 
351 TCGGCTTCGC GCTGCTGGAC 

4 01 ACCACCAGCG TGCGCAGCTA 
451 GGGGAG CGGG GCGTGGGGGC 
501 TGGTTCACCT GCTGGCACGC 
551 TGCGCCTACC ANGTGTGCGG 
601 TCAGGCCCGG CCCCCGCCAC 
651 CAACGGGCCT GGAAC CATAG 
701 CCAGCCCCGG GTGCGAGGAG 
751 GTTGCCCAAG AGGCCCAGGC 
8 01 CCGTTGGGCA GGGGTCCTGG 
851 GACCGTGGTT TCTGTGTGGT 
901 CTCTTTGGAG GGTGCGCTCT 
951 GCCGCCAGCA CCACGCGGGC 

1001 GGGACACGCC TTGTCCCCCG 

1051 TCCTCAGGCG ACAAGNACAC 

1101 AGGCCCAGCC TGACTGGCGT 

1151 TTCCAGGCCT TGGATGCCAG 

12 01 GCGNTACTGG CAAATGCGGC 

12 51 CGCAGTGCCC CTACGGGGTG 

13 01 GCGGTCACCC CAGCAGCCGG 

13 51 TGTGGCGGCC CCCGAGGAGG 

14 01 CTGCTCCGCC AG C A C AG C AG 

14 51 CTGCCTGCGC CGGCTGGTGC 

15 01 AACGCCGCTT CCTCAGGAAC 
1551 GCCAAGCTCT CGCTGCAGGA 

16 01 CGCTTGGCTG CGCAGGAGCC 

16 51 ACCGTCTGCG TGAGGAGATC 

17 01 GTGTACGTCG TCGAGCTGCT 
1751 GTTTCAAAAG AACAGGCTCT 

18 01 TGCAAAGCAT TGGAATCAGA 
1851 CTGTCGGAAG C AGAGGT C AG 
1901 GACGTCCAGA CTCCGCTTCA 
1951 TGAACATGGA CTACGTCGTG 
2 001 GCCGAGCGTC TCACCTCGAG 
2 051 C GAG CGGG CG CGGCGCCCCG 
2101 ACGATATCCA CAGGGCCTGG 
2151 GACCCGCCGC CTGAGCTGTA 

22 01 CGACACCATC CCCCAGGACA 
2251 AACCCCAGAA CACGTACTGC 

23 01 GCCCATGGGC ACGTCCGCAA 

23 51 AGACCTCCAG CCGTACATGC 

24 01 GCCCGCTGAG GGATGCCGTC 
2451 GCCAGCAGTG GCCTCTTCGA 



GCACGTGGGA AGCCCTGGCC CCGGCCACCC 
CGCTGCCGAG CCGTGCGCTC CCTGCTGCGC 
GCCGCTGGCC ACGTTCGTGC GGCGCCTGGG 
TGCAGCGCGG GGACCCGGCG GCTTTCCGCG 
GTGTGCGTGC CCTGGGANGN ANGGCNGCCC 
CCAGGTGTCC TGCCTGAANG ANCTGGTGGC 
GCGANCGCGG CGCGAANAAC GTGCTGGCCT 
GGGGCCCGCG GGGGCCCCCC CGAGGCCTTC 
CCTGCCCAAC ACGGTGACCG ACGCACTGCG 
TGCTGCTGCG CCGCGTGGGC GACGACGTGC 
TGCGCGNTNT TTGTGCTGGT GGNTCCCAGC 
GCCGCCGCTG TACCAGCTCG GCGCTGCNAC 
ACGCTANTGG ACCCGAANGC GTCTGGGATC 
CGTCAGGGAG GCCGGGGTCC CCCTGGGCTG 
GCGCGGGGGC AGTGCCAGCC GAAGTCTGCC 
GTGGCGCTGC CCCTGAGCCG GAGCGGACGC 
GCCCACCCGG GCAGGACGCC TGGACCGAGT 
GTCACCTGCC AGACCCGCCG AAGAAGCCAC 
CTGGCACGCG CCACTCCCAC CCATCCGTGG 
CCCCCATCCA CATCGCGGCC ACCACGTCCT 
GTGTACGCCG AGACCAAGCA CTTCCTCTAC 
TGCGNCCCTC CTTCCTACTC AATATATCTG 
TCGGGAGGTT CGTGGAGACA NTCTTTCTGG 
GATTGCCCGC AGGTTGCCCC GCCTGCCCCA 
CCCTGTTTCT GGAGCTGCTT GGGAACCACG 
TTCCTCAAGA CGCACTGCCC GCTGCGAGCT 
TGTCTGTGCC CGGGAGAAGC CCCAGGGCTC 
AGGAACACAG ACCCCCGTCG CCTGGTGCAG 
CCCCTGGCAG GTGTACGGCT TCGTGCGGGC 
CCCCAGGCCT CTGGGGCTGC AGGCACAACG 
AC C AAGAAGT TCATCTCCCT GGGGAAGCAT 
GCTGACGTGG AAGATGAGCG TGCGGGACTG 
CAGGGGTTGG CTGTGTTCCG GCCGCAGAGC 
CTGGCCAAGT TCCTGCACTG GCTGATGAGT 
CAGGTCTTTC TTTTATGTCA CGGAGACCAC 
TTTTCTACCG GAAGAGTGTC TGGAGCAAGT 
CAGCACTTGA AGAGGGTGCA GCTGCGGGAG 
GCAGCATCGG GAAGCCAGGC CCGCCCTGCT 
TCCCCAAGCC TGACGGGCTG CGGCCGATTG 
GGAGC CAGAA CGTTCCGCAG AGAAAAGAGG 
GGTGAAGGCA CTGTTCAGCG TGCTCAACTA 
GCCTCCTGGG CGCCTCTGTG CTGGGCCTGG 
CGCACCTTCG TGCTGCGTGT GCGGGCCCAG 
CTTTGTCAAG GTGGATGTGA CGGGCGCGTA 
GGCTCACGGA GGTCATCGCC AG CAT CAT CA 
GTGCGTCGGT ATGC CGTGGT CCAGAAGGCC 
GGCCTTCAAG AGCCACGTCT CTACCTTGAC 
GACAGTTCGT GGCTCACCTG CAGGANAACA 
GTCATCGAGC AGAGCTCCTC CCTGAATGAG 
CGTCTTCCTA CGCTTCATGT GCCACCACGC 
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2 501 CGTGCGCATC AGGGGCAAGT CCTACGTCCA GTGCCAGGGG ATCCCGCAGG 

2 5 51 GCTCCATCCT CTCCACGCTG CTCTGCAGCC TGTGCTACGG CGACATGGAG 

2 601 AACAAGCTGT TTGCGGGGAT TCGGCGGGAC GGGCTGCTCC TGCGTTTGGT 

2 651 GGATGATTTC TTGTTGGTGA CACCTCACCT CACCCACGCG AAAACCTTCC 

2 701 TCAGGACCCT GGTCCGAGGT GTCCCTGAGT ATGGCTGCGT GGTGAACTTG 

2 751 CGGAAGACAG TGGTGAACTT CCCTGTAGAA GACGAGGCGC TGGGTGGCAC 

2 801 GGCTTTTGTT CAGATGCCGG CCCACGGCCT ATTCCCCTGG TGCGGCCTGC 

2 851 TGCTGGATAC CCGGACCCTG GAGGTG C AGA GCGACTACTC CAGCTATGCC 

2 901 CGGACCTCCA T C AG AG C C AG TCTCACCTTC AACCGCGGCT TCAAGGCTGG 

2 951 GAGGAACATG CGTCGCAAAC TCTTTGGGGT CTTGCGGCTG AAGTGTCACA 

3 0 01 GCCTGTTTCT GGATTTGCAG GTGAACAGCC TCCAGACGGT GTGCACCAAC 
3 051 AT CTACAAGA TCCTCCTGCT GCAGGCGTAC AGGTTTCACG CATGTGTGCT 
3101 GCAGCTCCCA TTTCATCAGC AAGTTTGGAA GAACCCCACA TTTTTCCTGC 
3151 GCGTCATCTC TGACACGGCC TCCCTCTGCT ACTCCATCCT GAAAGCCAAG 
32 01 AACGCAGGGA TGTCGCTGGG GGCCAAGGGC GCCGCCGGCC CTCTGCCCTC 
3 251 CGAGGC CGTG CAGTGGCTGT GCCACCAAGC ATTCCTGCTC AAGCTGACTC 
3 3 01 GACACCGTGT CACCTACGTG CCACTCCTGG GGTCACTCAG GA C AG C C C AG 
3 3 51 ACGCAGCTGA GTCGGAAGCT CCCGGGGACG ACGCTGACTG CGCTGGAGGC 
34 01 CGCAGCCAAC CCGGCACTGC CCTCAGACTT CAAGACCATC CTGGACTGAT 
34 51 GGCCACCCGC CCAGAGCCAG GCCGAGAGCA GACACCAGCA GCCCTGTCAC 
3 5 01 GCCGGGCTCT ACGTCCCAGG GAGGGAGGGG CGGCCCAGAC CCAGGCCCGC 
3 551 ACCGCTGGGA GTCTGAGGCC TGAGTGAGTG TTTGGCCGAG GCCTGCATGT 
3 601 CCGGCTGAAG GCTGAGTGTC CGGCTGAGGC CTGAGCGAGT GTCCAGCCAA 
3 651 GGGCTGAGTG TCCAGCACAC CTGCCGTCTT CACTTCCCCA CAGGCTGGCG 
3 701 CTCGGCTCCA CCCCAGGGCC AGCTTTTCCT CACCAGGAGC CCGGCTTCCA 
3 751 CTCCCCACAT AGGAATAGTC CATCCCCAGA TTCGCCATTG TTCACCCCTC 
3 801 GCCCTGCCCT CCTTTGCGTT CCACCCCCAC CATCCAGGTG GAGACCCTGA 
3 851 GAAGGACCCT GGGAGCTCTG GGAATTTGGA GTGAC C AAAG GTGTGCCCTG 
3 901 TACACAGGCG AGGACCCTGC ACCTGGATGG GGGTCCCTGT GGGT CAAATT 

3 951 GGGGGGAGGT GCTGTGGGAG TAAAATACTG AATATATGAG TTTTTCAGTT 

4 001 TTGAAAAAAA AAAAAAAAAA AAAAAAAAA 
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GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCGATGCC 

1 + + + + + + 60 

CGTCGCGACGCAGGACGACGCGTGCACCCTTCGGGACCGGGGCCGGTGGGGGCGCTACGG 

AALRPAAHVGS PGPGHPRDA 
QRCVLLRTWEALAPATPAMP- 
SAASCCARGKPWPRPPPRCR- 

GCGCGCTCCCCGCTGCCGAGCCGTGCGCTCCCTGCTGCGCAGCCACTACCGCGAGGTGCT 
61 + + + + + + 12Q 

CGCGCGAGGGGCGACGGCTCGGCACGCGAGGGACGACGCGTCGGTGATGGCGCTCCACGA 

ARS PLPSRALPAAQ PLPRGA 
RAPRCRAVRSLLRSHYREVL 
ALPAAEPCAPCCAATTARCC- 

GCCGCTGGCCACGTTCGTGCGGCGCCTGGGGCCCCAGGGCTGGCGGCTGGTGCAGCGCGG 

121 + + + + + + 180 

CGGCGACCGGTGCAAGCACGCCGCGGACCCCGGGGTCCCGACCGCCGACCACGTCGCGCC 

AAGHVRAAPGAPGL.AAGAAR 
PLATFVRRLGPQGWRLVQRG- 
RWPRS CGAWGPRAGGWC SAG- 

GGACCCGGCGGCTTTCCGCGCGNTGGTGGCCCANTGCNTGGTGTGCGTGCCCTGGGANGN 

181 + + + + + + 240 

CCTGGGCCGCCGAAAGGCGCGCNACCACCGGGTNACGNACCACACGCACGGGACCCTNCN 

GPGGFPR?GGP ? PGVRALG? 
DPAAFR. A?VA?C?VCVPW? ? 
TRRLSARWWP?AWCACPG? ? - 

ANGGCNGCCCCCCGCCGCCCCCTCCTTCCGCCAGGTGTCCTGCCTGAANGANCTGGTGGC 

241 + + + + + + 300 

TNCCGNCGGGGGGCGGCGGGGGAGGAAGGCGGTCCACAGGACGGACTTNCTNGACCACCG 

?AAPRRPLLPPGVLPE??GG 
? ?PPAAPSFRQVSCL??LVA- 
G?PPPPPPSARCPA*??WWP- 

CCGAGTGCTGCANANGCTGTGCGANCGCGGCGCGAAMAACGTGCTGGCCTTCGGCTTCGC 

301 + + + + + + 360 

GGCTCACGACGTNTNCGACACGCTNGCGCCGCGCTTNTTGCACGACCGGAAGCCGAAGCG 

PSAA?AVR?RRE?RAGLRLR 
RVL? ?LC?RGA?NVLAFGFA- 
ECC? ?CA?AAR?TCWPSASR- 

GCTGCTGGACGGGGCCCGCGGGGGCCCCCCCGAGGCCTTCACCACCAGCGTGCGCAGCTA 
361 + + + + + + 420 

CGACGACCTGCCCCGGGCGCCCCCGGGGGGGCTCCGGAAGTGGTGGTCGCACGCGTCGAT 

AAGRGPRGPPRGLHHQRAQL 
LLDGARGGPPEAFTTSVRSY 
CWTGPAGAPPRPS PPACAAT- 

CCTGCCCAACACGGTGACCGACGCACTGCGGGGGAGCGGGGCGTGGGGGCTGCTGCTGCG 

421 + + + + + + 480 

GGACGGGTTGTGCCACTGGCTGCGTGACGCCCCCTCGCCCCGCACCCCCGACGACGACGC 

PAQHGDRRTAGERGVGAAAA 
LPNTVTDALRGSGAWGLLLR- 
CPTR* PTHCGGAGRGGC CCA- 
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PRGRRRAGS PAGTLR? ? C A G 
RVGDDVLVHLLARCA ? F V L V - 
AWATTCWFTCWHAAR ? LCWW- 

GGNTCCCAGCTGCGCCTACCANGTGTGCGGGCCGCCGCTGTACCAGCTCGGCGCTGCNAC 
541 + + + + + + 600 

CCNAGGGTCGACGCGGATGGTNCACACGCCCGGCGGCGACATGGTCGAGCCGCGACGNTG 

GSQLRLP ? VRAAAVPARRC? 
? PSCAY?VCGPPLYQLGAAT- 
? PAAPT ? CAGRRCTSSAL ? L - 

TCAGGCCCGGCCCCCGCCACACGCTANTGGACCCGAANGCGTCTGGGATCCAACGGGCCT 

601 + + + + + + 660 

AGTCCGGGCCGGGGGCGGTGTGCGATNACCTGGGCTTNCGCAGACCCTAGGTTGCCCGGA 

SGPAPATR?WTR?RLGSNGP 
QARPPPHA?GPE?VWDPTGL- 
RPGPRHTL ? D P ?ASGIQRAW- 

GGAACCATAGCGTCAGGGAGGCCGGGGTCCCCCTGGGCTGCCAGCCCCGGGTGCGAGGAG 
661 + + + + + + 72Q 

CCTTGGTATCGCAGTCCCTCCGGCCCCAGGGGGACCCGACGGTCGGGGCCCACGCTCCTC 

GTIASGRPGSPWAASPGCEE 
EP*RQGGRGPPGLPAPGARR- 
NHSVREAGVPLGCQ PRVRGG- 

GCGCGGGGGCAGTGCCAGCCGAAGTCTGCCGTTGCCCAAGAGGCCCAGGCGTGGCGCTGC 
721 + + + + + + 780 

CGCGGGCCCGTCACGGTCGGCTTCAGACGGCAACGGGTTCTCCGGGTCCGCACCGCGACG 

ARGQCQ PKSAVAQEAQAWRC 
RGGSASRSLPLPKRPRRGAA- 
AGAVPAEVCRC PRG PGVAL P- 

CCCTGAGCCGGAGCGGACGCCCGTTGGGCAGGGGTCCTGGGCCCACCCGGGCAGGACGCC 

781 + + + + + + 840 

GGGACTCGGCCTCGCCTGCGGGCAACCCGTCCCCAGGACCCGGGTGGGCCCGTCCTGCGG 

P *AGADARWAGVLGPPGQDA 
PEPERTPVGQGSWAHPGRTP- 
LSRSGRPLGRGPGPTRAGRL- 

TGGACCGAGTGACCGTGGTTTCTGTGTGGTGTCACCTGCCAGACCCGCCGAAGAAGCCAC 

841 + + + + + + 900 

ACCTGGCTCACTGGCACCAAAGACACACCACAGTGGACGGTCTGGGCGGCTTCTTCGGTG 

WTE* PWFLGGVTCQTRRRSH 
GPSDRGFCVVSPARPAEEAT- 
DRVTVVS VWCHLPDPPKKPP- 

CTCTTTGGAGGGTGCGCTCTCTGGCACGCGCCACTCCCACCCATCCGTGGGCCGCCAGCA 

901 + + + + + + 960 

GAGAAACCTCCCACGCGAGAGACCGTGCGCGGTGAGGGTGGGTAGGCAGCCGGCGGTCGT 

LFGGCALWHAPLPPIRGPPA 
SLEGALSGTRHSHPSVGRQH- 
LWRVRS LARAT PTH PWAAS T - 

CCACGCGGGCCCCCCATCCACATCGCGGCCACCACGTCCTGGGACAGGCCTTGTCCCCCG 

961 + + + + + + 1020 

GGTGCGCCCGGGGGGTAGGTGTAGCGCCGGTGGTGCAGGACCCTGTGCGGAACAGGGGGC 
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PRGPPIH1AATTSWDTPCPP 
HAGPPSTSRPPRPGTRLVPR- 
TRAPHPHRGHHVLGHALS P G - 

GTGTACGCCGAGACCAAGCACTTCCTCTACTCCTCAGGCGACAAGNACACTGCGNCCCTC 

1021 + + + + + + 1080 

CACATGCGGCTCTGGTTCGTGAAGGAGATGAGGAGTCCGCTGTTCNTGTGACGCNGGGAG 

VYAETKHFLYSSGDK?TA?L 
CTPRPSTSSTPQAT7TLRPS - 
VRRDQALPLLLRRQ9HC ? P P - 

CTTCCTACTCAATATATCTGAGGCCCAGCCTGACTGGCGTTCGGGAGGTTCGTGGAGACA 

1081 + + + + + + H40 

GAAGGATGAGTTATATAGACTCCGGGTCGGACTGACCGCAAGCCCTCCAAGCACCTCTGT 

LPTQYI *GPA*LAFGRFVET 
FLLNI SEAQPDWRSGGSWR? - 
SYS IYLRPSLTGVREVRGD ? - 

NTCTTTCTGGTTCCAGGCCTTGGATGCCAGGATTCCCCGCAGGTTGCCCCGCCTGCCCCA 

1141 + + + + + + 1200 

NAGAAAGACCAAGGTCCGGAACCTACGGTCCTAAGGGGCGTCCAACGGGGCGGACGGGGT 

PFLVPGLGCQDSPQVAPPAP 
SFWFQALDARI PRRLPRLPQ 
LSGSRPWMPGFPAGCPACPS- 

GCGNTACTGGCAAATGCGGCCCCTGTTTCTGGAGCTGCTTGGGAACCACGCGCAGTGCCC 

1201 + + + + + + 1260 

CGCNATGACCGTTTACGCCGGGGACAAAGACCTCGACGAACCCTTGGTGCGCGTCACGGG 

A? LANAAPVSGAAWE PRAVP 
RYWQMRPLFLELLGNHAQCP- 
? TGKCGPCFWSCLGTTRSAP- 

CTACGGGGTGTTCCTCAAGACGCACTGCCCGCTGCGAGCTGCGGTCACCCCAGCAGCCGG 

1261 + + + + + + 1320 

GATGCCCCACAAGGAGTTCTGCGTGACGGGCGACGCTCGACGCCAGTGGGGTCGTCGGCC 

LRGVPQDALPAASCGHPSSR 
YGVFLKTHC PLRAAVTPAAG 
TGCSSRRTARCELRS PQQPV- 

TGTCTGTGCCCGGGAGAAGCCCCAGGGCTCTGTGGCGGCCCCCGAGGAGGAGGAACACAG 

1321 + + + + + + 1380 

ACAGAGACGGGCCCTCTTCGGGGTCCCGAGACACCGCCGGGGGCTCCTCCTCCTTGTGTC 

CLCPGEAPGLCGGPRGGGTQ 
VCAREKPQGSVAAPEEEEHR- 
SVPGRS PRALWRP PRRRNTD- 

ACCCCCGTCGCCTGGTGCAGCTGCTCCGCCAGCACAGCAGCCCCTGGCAGGTGTACGGCT 
1381 + + + + + + 1440 

TGGGGGCAGCGGACCACGTCGACGAGGCGGTCGTGTCGTCGGGGACCGTCCACATGCCGA 

TPVAWCSCSASTAAPGRCTA 
P PS PGAAAP PAQQPLAGVRL 
PRRLVQLLRQHSS PWQVYGF- 

TCGTGCGGGCCTGCCTGCGCCGGCTGGTGCCCCCAGGCCTCTGGGGCTCCAGGCACAACG 

1441 + + + + + + 1500 

AGCACGCCCGGACGGACGCGGCCGACCACGGGGGTCCGGAGACCCCGAGGTCCGTGTTGC 
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SCGPACAGWCPQASGAPGTT 
RAGLPAPAGAPRPLGLQAQR 
VRACLRRLVPPGLWGSRHNE 



AACGCCGCTTCCTCAGGAACACCAAGAAGTTCATCTCCCTGGGGAAGCATGCCAAGCTCT 
TTGCGGCGAAGGAGTCCTTGTGGTTCTTCAAGTAGAGGGACCCCTTCGTACGGTTCGAGA 



NAASSGTPRSSSPWGSMPSS 
TPLPQEHQEVHLPGEACQAL 
RRFLRNTKKFI SLGKHAKLS 



CGCTGCAGGAGCTGACGTGGAAGATGAGCGTGCGGGACTGCGCTTGGC TGCGCAGGAGC C 
GCGACGTCCTCGACTGCACCTTCTACTCGCACGCCCTGACGCGAACCGACGCGTCCTCGG 



RCRS*RGR*ACGTALGCAGA 
AAGADVEDERAGLRLAAQEP - 
LQELTWKMSVRDCAWLRRS P - 



CAGGGGTTGGCTGTGTTCCGGCCGCAGAGCACCGTCTGCGTGAGGAGATCCTGGCCAAGT 
+ + + + + + 

GTCCCCAACCGACACAAGGCCGGCGTCTCGTGGCAGACGCACTCCTCTAGGACCGGTTCA 



QGLAVFRPQSTVCVRRSWPS 
RGWLCSGRRAPSA* GDPGQV- 
GVGCVPAAEHRLREE I LAKF- 

TCCTGCACTGGCTGATGAGTGTGTACGTCGTCGAGCTGCTCAGGTCTTTCTTTTATGTCA 
1681 + + + + + + 1740 

AGGACGTGACCGACTACTCACACATGCAGCAGCTCGACGAGTCCAGAAAGAAAATACAGT 

SCTG**VCTSSSCSGLSFMS 
PALADE CVRRRAAQVFLLCH 
LHWLMSVYVVELLRS FFYVT- 



CGGAGACCACGTTTCAAAAGAACAGGCTCTTTTTCTACCGGAAGAGTGTCTGGAGCAAGT 
+ + + + + + 

GCCTCTGGTGCAAAGTTTTCTTGTCCGAGAAAAAGATGGCCTTCTCACAGACCTCGTTCA 



RRPRFKRTGSFSTGRVSGAS 
GDHVSKEQALFLPEECLEQV 
ETTFQKNRLFFYRKSVWSKL- 

TGCAAAGCATTGGAATCAGACAGCACTTGAAGAGGGTGCAGCTGCGGGAGCTGTCGGAAG 
1801 + + + + + + lg60 

ACGTTTCGTAACCTTAGTCTGTCGTGAACTTCTCCCACGTCGACGCCCTCGACAGCCTTC 

CKALESDST*RGCSCGSCRK 
AKHWNQTALEEGAAAGAVGS - 
QS IGIRQHLKRVQLRELSEA- 

CAGAGGTCAGGCAGCATCGGGAAGCCAGGCCCGCCCTGCTGACGTCCAGACTCCGCTTCA 

1861 + + + + + + 1920 

GTCTCCAGTCCGTCGTAGCCCTTCGGTCCGGGCGGGACGACTGCAGGTCTGAGGCGAAGT 

QRSGSIGKPGPPC*RPDSAS 
RGQAASGSQARPADVQTPLH - 
EVRQHREARPALLTSRLRFI- 

TCCCCAAGCCTGACGGGCTGCGGCCGATTGTGAACATGGACTACGTCGTGGGAGCCAGAA 
1921 + + + + + + 1980 

AGGGGTTCGGACTGCCCGACGCCGGCTAACACTTGTACCTGATGCAGCACCCTCGGTCTT 
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SPSLTGCGRL*TWTTSWEPE 
P Q A * RAAADCEHGLRRGSQN- 
PKPDGLRP IVNMDYVVGART- 

CGTTCCGCAGAGAAAAGAGGGCCGAGCGTCTCACCTCGAGGGTGAAGGCACTGTTCAGCG 
1981 + + + + + + 204Q 

GCAAGGCGTCTCTTTTCTCCCGGCTCGCAGAGTGGAGCTCCCACTTCCGTGACAAGTCGC 

RSAEKRGPSVSPRG*RHCSA 
VPQRKEGRASHLEGEGTVQR- 
FRRE KRAERLT S RVKAL F S V ~ 

TGCTCAACTACGAGCGGGCGCGGCGCCCCGGCCTCCTGGGCGCCTCTGTGCTGGGCCTGG 

2041 + + + + + + 2100 

ACGAGTTGATGCTCGCCCGCGCCGCGGGGCCGGAGGACCCGCGGAGACACGACCCGGACC 

CSTTSGRGAPASWAPLCWAW 
AQLRAGAAPRPPGRLCAGPG- 
LNYERARRPGLLGASVLGLD- 

ACGATATCCACAGGGCCTGGCGCACCTTCGTGCTGCGTGTGCGGGCCCAGGACCCGCCGC 

2101 + + + + + + 2160 

TGCTATAGGTGTCCCGGACCGCGTGGAAGCACGACGCACACGCCCGGGTCCTGGGCGGCG 

TISTGPGAPSCCVCGPRTRR 
RYPQGLAHLRAACAGPGPAA- 
D IHRAWRTFVLRVRAQD P P P - 

CTGAGCTGTACTTTGTCAAGGTGGATGTGACGGGCGCGTACGACACCATCCCCCAGGACA 

2161 + + + + + + 2220 

GACTCGACATGAAACAGTTCCACCTACACTGCCCGCGCATGCTGTGGTAGGGGGTCCTGT 

LSCTLSRWM*RARTTPSPRT 
*AVLCQGGCDGRVRHHPPGQ 
ELYFVKVDVTGAYDT I P QDR- 

GGCTCACGGAGGTCATCGCCAGCATCATCAAACCCCAGAACACGTACTGCGTGCGTCGGT 

2221 + + + + + + 2280 

CCGAGTGCCTCCAGTAGCGGTCGTAGTAGTTTGGGGTCTTGTGCATGACGCACGCAGCCA 

GSRRSS PASSNPRTRTACVG 
AHGGHRQHHQTPEHVLRASV- 
LTEVIAS I I KPQNTYCVRRY- 

ATGCCGTGGTCCAGAAGGCCGCCCATGGGCACGTCCGCAAGGCCTTCAAGAGCCACGTCT 

2281 + + + + + + 2340 

TACGGCACCAGGTGTTCCGGCGGGTACCCGTGCAGGCGTTCCGGAAGTTCTCGGTGCAGA 

MPWSRRPPMGTSARPSRATS 
CRGPEGRPWARPQGLQEPRL 
AVVQKAAHGHVRKAFKSHVS- 

CTACCTTGACAGACCTCCAGCCGTACATGCGACAGTTCGTGGCTCACCTGCAGGANAACA 

2341 + + + + + + 2400 

GATGGAACTGTCTGGAGGTCGGCATGTACGCTGTCAAGCACCGAGTGGACGTCCTNTTGT 

LP*QTSSRTCDSSWLTCR?T 
YLDRPPAVHATVRGS PAG? Q 
TLTDLQPYMRQFVAHLQ ?NS- 

GCCCGCTGAGGGATGCCGTCGTCATCGAGCAGAGCTCCTCCCTGAATGAGGCCAGCAGTG 

2401 + + + + + + 2460 

CGGGCGACTCCCTACGGCAGCAGTAGCTCGTCTCGAGGAGGGACTTACTCCGGTCGTCAC 
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AR*GMPSSSSRAPP*MRPAV 
PAEGCRRHRAELLPE * G Q Q W - 
PLRDAVVIEQSSSLNEASSG- 

GCCTCTTCGACGTCTTCCTACGCTTCATGTGCCACCACGCCGTGCGCATCAGGGGCAAGT 

2461 + + + + + + 2520 

CGGAGAAGCTGCAGAAGGATGCGAAGTACACGGTGGTGCGGCACGCGTAGTCCCCGTTCA 

ASSTSSYASCATTPCASGAS 
PLRRLPTLHVPPRRAHQGQV- 
LFDVFLRFMCHHAVR IRGKS- 

CCTACGTCCAGTGCCAGGGGATCCCGCAGGGCTCCATCCTCTCCACGCTGCTCTGCAGCC 

2521 + + + + + + 2580 

GGATGCAGGTCACGGTCCCCTAGGGCGTCCCGAGGTAGGAGAGGTGCGACGAGACGTCGG 

PTSSARGSRRAPSSPRCSAA 
LRPVPGDPAGLHPLHAALQP- 
YVQCQGI PQGS I LSTLLCSL- 

TGTGCTACGGCGACATGGAGAACAAGCTGTTTGCGGGGATTCGGCGGGACGGGCTGCTCC 

2581 + + + + + + 2640 

ACACGATGCCGCTGTACCTCTTGTTCGACAAACGCCCCTAAGCCGCCCTGCCCGACGAGG 

CATATWRTSCLRGFGGTGCS 
VLRRHGEQAVCGDSAGRAAP- 
CYGDMENKLFAG I RRDGLLL- 

TGCGTTTGGTGGATGATTTCTTGTTGGTGACACCTCACCTCACCCACGCGAAAACCTTCC 

2641 + + + + + + 2700 

ACGCAAACCACCTACTAAAGAACAACCACTGTGGAGTGGAGTGGGTGCGCTTTTGGAAGG 

CVWWMISCW*HLTSPTRKPS 
A F G G * FLVGDTSPHPRENLP 
RLVDDFLLVT PHLTHAKTFL- 

TCAGGACCCTGGTCCGAGGTGTCCCTGAGTATGGCTGCGTGGTGAACTTGCGGAAGACAG 

2701 + + + + + + 2760 

AGTCCTGGGACCAGGCTCCACAGGGACTCATACCGACGCACCACTTGAACGCCTTCTGTC 

SGPWSEVSLSMAAW*TCGRQ 
QDPGPRCP*VWLRGELAEDS- 
RTLVRGVPEYGCVVNLRKTV- 

TGGTGAACTTCCCTGTAGAAGACGAGGCCCTGGGTGGCACGGCTTTTGTTCAGATGCCGG 

2761 + + + + + + 2820 

ACCACTTGAAGGGACATCTTCTGCTCCGGGACCCACCGTGCCGAAAACAAGTCTACGGCC 

W*TSL*KTRPWVARLLFRCR 
GELPCRRRGPGWHGFCSDAG- 
VNFPVEDEALGGTAFVQMPA- 

CCCACGGCCTATTCCCCTGGTGCGGCCTGCTGCTGGATACCCGGACCCTGGAGGTGCAGA 

2821 + + + + + + 2880 

GGGTGCCGGATAAGGGGACCACGCCGGACGACGACCTATGGGCCTGGGACCTCCACGTCT 

PTAYSPGAACCWIPGPWRCR 
PRP I PLVRPAAGYPDPGGAE 
HGLFPWCGLLLDTRTLEVQS- 

GCGACTACTCGAGCTATGCCCGGACCTCCATCAGAGCCAGTCTCACCTTCAACCGCGGCT 

2881 + + + + + + 2940 

CGCTGATGAGGTCGATACGGGCCTGGAGGTAGTCTCGGTCAGAGTGGAAGTTGGCGCCGA 
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ATTPAMPGPPSEPVSPSTAA 
RLLQLCPDLHQSQSHLQPRL- 
DYS S Y A R T S IRASLT F N R G F - 

TCAAGGCTGGGAGGAACATGCGTCGCAAACTCTTTGGGGTCTTGCGGCTGAAGTGTCACA 
2941 + + + + + + 3 000 

AGTTCCGACCCTCCTTGTACGCAGCGTTTGAGAAACCCCAGAACGCCGACTTCACAGTGT 

SRLGGTCVANSLGSCG* SVT 
QGWEEHASQTLWGLAAEVSQ- 
KAGRNMRRKLFGVLRLKCHS- 

GCCTGTTTCTGGATTTGCAGGTGAACAGCCTCCAGACGGTGTGCAGCAACATCTACAAGA 

3001 + + + + + + 3060 

CGGACAAAGACCTAAACGTCCACTTGTCGGAGGTCTGCCACACGTGGTTGTAGATGTTCT 

ACFWI CR*TASRRCAPTSTR 
PVSGFAGEQP PDGVHQHLQD 
LFLDLQVNSLQTVCTNIYKI- 

TCCTCCTGCTGCAGGCGTACAGGTTTCACGCATGTGTGCTGCAGCTCCCATTTCATCAGC 

3061 + + -f + + + 3120 

AGGAGGACGACGTCCGCATGTCCAAAGTGCGTACACACGACGTCGAGGGTAAAGTAGTCG 

SSCCRRTGFTHVCCSSHFIS 
PPAAGVQVSRMCAAAPISSA- 
LLLQAYRFHACVLQL PFHQQ- 

AAGTTTGGAAGAACCCCACATTTTTCCTGCGCGTCATCTCTGACACGGCCTCCCTCTGCT 
3121 + + + + + + 3180 

TTCAAACCTTCTTGGGGTGTAAAAAGGACGCGCAGTAGAGACTGTGCCGGAGGGAGACGA 

KFGRTPHFSCASSLTRPPSA 
SLEEPHIFPARHL*HGLPLiL- 
VWKNPTFFLRVISDTASLCY- 

ACTCCATCCTGAAAGCCAAGAACGCAGGGATGTCGCTGGGGGCCAAGGGCGCCGCCGGCC 

3181 + + + + + + 3240 

TGAGGTAGGACTTTCGGTTCTTGCGTCCCTACAGCGACCCCCGGTTCCCGCGGCGGCCGG 

TPS* KPRTQGCRWGPRAPPA 
LHPESQERRDVAGGQGRRRP - 
S I LKAKNAGMSLGAKGAAGP- 

CTCTGCCCTCCGAGGCCGTGCAGTGGCTGTGCCACCAAGCATTCCTGCTCAAGCTGACTC 

3241 + + + + + + 3300 

GAGACGGGAGGCTCCGGCACGTCACCGACACGGTGGTTCGTAAGGACGAGTTCGACTGAG 

LCPPRPCSGCATKHSCSS*L 
SALRGRAVAVPPS I PAQADS - 
LPSEAVQWLCHQAFLLKLTR- 

GACACCGTGTCACCTACGTGCCACTCCTGGGGTCACTCAGGACAGCCCAGACGCAGCTGA 

3301 + + + + + + 3360 

CTGTGGCACAGTGGATGCACGGTGAGGACCCCAGTGAGTCCTGTCGGGTCTGCGTCGACT 

DTVS PTCHSWGHSGQPRRS * 
TPCHLRATPGVTQDSPDAAE - 
HRVTYVPLLGS LRTAQTQLS- 

GTCGGAAGCTCCCGGGGACGACGCTGACTGCCCTGGAGGCCGCAGCCAACCCGGCACTGC 

3361 + + + + + + 3420 

CAGCCTTCGAGGGCCCCTGCTGCGACTGACGGGACCTCCGGCGTCGGTTGGGCCGTGACG 
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a VGS SRGRR* LPWRPQPTRHC 

b SEAPGDDADCPGGRSQPGTA- 

C RKLPGTTLTALEAAANPALP- 

CCTCAGACTTCAAGACCATCCTGGACTGATGGCCACCCGCCCACAGCCAGGCCGAGAGCA 

3421 + + + + + + 3480 

GGAGTCTGAAGTTCTGGTAGGACCTGACTACCGGTGGGCGGGTGTCGGTCCGGCTCTCGT 

a PQTSRPSWTDGHPPTARPRA 

b LRLQDHPGLMATRPQPGREQ 

c SDFKTILD*WPPAHSQAESR- 

GACACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACAC 
3481 + + + + + + 3540 

CTGTGGTCGTCGGGACAGTGCGGCCCGAGATGCAGGGTCCCTCCCTCCCCGCCGGGTGTG 

a DTSS PVTPGSTSQGGRGGPH 

b TPAALSRRALRPREGGAAHT 

c HQQPCHAGLYVPGREGRPTP- 

CCAGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGT 

3541 + + + + + + 3600 

GGTCCGGGCGTGGCGACCCTCAGACTCCGGACTCACTCACAAACCGGCTCCGGACGTACA 

a PGPHRWESEA*VSVWPRPAC 

b QARTAGSLRPE *VFGRGLHV- 

c RPAPLGV*GLSECLAEACMS~ 

CCGGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTG 

3601 + + + + + + 3660 

GGCCGACTTCCGACTCACAGGCCGACTCCGGACTCGCTCACAGGTCGGTTCCCGACTCAG 

a PAEG*VSG*GLSECPAKG*V 

b RLKAECPAEA*ASVQPRAEC 

c G*RLSVRLRPERVSSQGLSV- 

TCCAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCC 

3661 + + + + + + 3720 

AGGTCGTGTGGACGGCAGAAGTGAAGGGGTGTCCGACCGCGAGCCGAGGTGGGGTCCCGG 

a SSTPAVFTSPQAGARLHPRA 

b PAHLPSSLPHRLALGSTPGP- 

c QHTCRLHFPTGWRSAP P QGQ- 

AGCTTTTCCTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGA 

3721 + + + + + + 3780 

TCGAAAAGGAGTGGTCCTCGGGCCGAAGGTGAGGGGTGTATCCTTATCAGGTAGGGGTCT 

a SFSSPGARLPLPT*E*SIPR 

b AFPHQEPGFHSPHRNS PSPD- 

c LFLTRSPASTPHIGIVHPQI- 

TTCGCCATTGTTCACCCCTCGCCCTGCCCTCCTTTGCCTTCCACCCCCACCATCCAGGTG 
3781 + + + + + + 3840 

AAGCGGTAACAAGTGGGGAGCGGGACGGGAGGAAACGGAAGGTGGGGGTGGTAGGTCCAC 

a FAIVHPSPCPPLPSTPTIQV 

b SPLFTPRPALLCLPPPPSRW- 

c RHCS PLALPS FAFHPHH PGG- 

GAGACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTG 

3841 + + + + + + 3900 

CTCTGGGACTCTTCCTGGGACCCTCGAGACCCTTAAACCTCACTGGTTTCCACACGGGAC 
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ETLRRTLGALGIWSDQRCAL 
RP*EGPWELWEFGVTKGVPC 
DPEKDPGSSGNLE*PKVCPV- 

TACACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGT 

3901 + + + + i + 3960 

ATGTGTCCGCTCCTGGGACGTGGACCTACCCCCAGGGACACCCAGTTTAACCCCCCTCCA 

YTGED PAPGWGSLWVKLGGG 
TQARTLHLDGGPCGSNWGEV- 
HRRGPCTWMGVPVGQ I GGRC- 

GCTGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGAAAAAAAAAAAAAAAAA 

3961 + + + + + + 4020 

CGACACCCTCATTTTATGACTTATATACTCAAAAAGTCAAAACTTTTTTTTTTTTTTTTT 

A V G V K Y * IYEFFSFEKKKKK 
LWE *NTEYMSFSVLiKKKKKK- 
CGSKILNI * V F Q F * KKKKKK- 

AAAAAAAAA 

4021 4029 

TTTTTTTTT 

K K K 
K K 
K K 
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i 

met 

GCAGCGCTGCGTCCTGCTGCGCACGTGGGAAGCCCTGGCCCCGGCCACCCCCGCG ATG 

10 

pro arg ala pro arg cys arg ala val arg ser leu leu arg ser 
CCG CGC GCT CCC CGC TGC CGA GCC GTG CGC TCC CTG CTG CGC AGC 

20 30 
his tyr arg glu val leu pro leu ala thr phe val arg arg leu 
CAC TAC CGC GAG GTG CTG CCG CTG GCC ACG TTC GTG CGG CGC CTG 

40 

gly pro gin gly trp arg leu val gin arg gly asp pro ala ala 
GGG CCC CAG GGC TGG CGG CTG GTG CAG CGC GGG GAC CCG GCG GCT 

50 60 
phe arg ala leu val ala gin cys leu val cys val pro trp asp 
TTC CGC GCG CTG GTG GCC CAG TGC CTG GTG TGC GTG CCC TGG GAC 

70 

ala arg pro pro pro ala ala pro ser phe arg gin val ser cys 
GCA CGG CCG CCC CCC GCC GCC CCC TCC TTC CGC CAG GTG TCC TGC 

80 90 
leu lys glu leu val ala arg val leu gin arg leu cys glu arg 
CTG AAG GAG CTG GTG GCC CGA GTG CTG CAG AGG CTG TGC GAG CGC 

100 

gly ala lys asn val leu ala phe gly phe ala leu leu asp gly 
GGC GCG AAG AAC GTG CTG GCC TTC GGC TTC GCG CTG CTG GAC GGG 

110 120 
ala arg gly gly pro pro glu ala phe thr thr ser val arg ser 
GCC CGC GGG GGC CCC CCC GAG GCC TTC ACC ACC AGC GTG CGC AGC 

130 

tyr leu pro asn thr val thr asp ala leu arg gly ser gly ala 
TAC CTG CCC AAC ACG GTG ACC GAC GCA CTG CGG GGG AGC GGG GCG 

140 150 
trp gly leu leu leu arg arg val gly asp asp val leu val his 
TGG GGG CTG CTG CTG CGC CGC GTG GGC GAC GAC GTG CTG GTT CAC 

160 

leu leu ala arg cys ala leu phe val leu val ala pro ser cys 
CTG CTG GCA CGC TGC GCG CTC TTT GTG CTG GTG GCT CCC AGC TGC 

170 180 
ala tyr gin val cys gly pro pro leu tyr gin leu gly ala ala 
GCC TAC CAG GTG TGC GGG CCG CCG CTG TAC CAG CTC GGC GCT GCC 

190 

thr gin ala arg pro pro pro his ala ser gly pro arg arg arg 
ACT CAG GCC CGG CCC CCG CCA CAC GCT AGT GGA CCC CGA AGG CGT 
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200 210 
leu gly cys glu arg ala trp asn his ser val arg glu ala gly 
CTG GGA TGC GAA CGG GCC TGG AAC CAT AGC GTC AGG GAG GCC GGG 

220 

val pro leu gly leu pro ala pro gly ala arg arg arg gly gly 
GTC CCC CTG GGC CTG CCA GCC CCG GGT GCG AGG AGG CGC GGG GGC 

230 240 
ser ala ser arg ser leu pro leu pro lys arg pro arg arg gly 
AGT GCC AGC CGA AGT CTG CCG TTG CCC AAG AGG CCC AGG CGT GGC 

250 

ala ala pro glu pro glu arg thr pro val gly gin gly ser trp 
GCT GCC CCT GAG CCG GAG CGG ACG CCC GTT GGG CAG GGG TCC TGG 

260 270 
ala his pro gly arg thr arg gly pro ser asp arg gly phe cys 
GCC CAC CCG GGC AGG ACG CGT GGA CCG AGT GAC CGT GGT TTC TGT 

280 

val val ser pro ala arg pro ala glu glu ala thr ser leu glu 
GTG GTG TCA CCT GCC AGA CCC GCC GAA GAA GCC ACC TCT TTG GAG 

290 300 
gly ala leu ser gly thr arg his ser his pro ser val gly arg 
GGT GCG CTC TCT GGC ACG CGC CAC TCC CAC CCA TCC GTG GGC CGC 

310 

gin his his ala gly pro pro ser thr ser arg pro pro arg pro 
CAG CAC CAC GCG GGC CCC CCA TCC ACA TCG CGG CCA CCA CGT CCC 

320 330 
trp asp thr pro cys pro pro val tyr ala glu thr lys his phe 
TGG GAC ACG CCT TGT CCC CCG GTG TAC GCC GAG ACC AAG CAC TTC 

340 

leu tyr ser ser gly asp lys glu gin leu arg pro ser phe leu 
CTC TAC TCC TCA GGC GAC AAG GAG CAG CTG CGG CCC TCC TTC CTA 

350 360 
leu ser ser leu arg pro ser leu thr gly ala arg arg leu val 
CTC AGC TCT CTG AGG CCC AGC CTG ACT GGC GCT CGG AGG CTC GTG 

370 

glu thr ile phe leu gly ser arg pro trp met pro gly thr pro 
GAG ACC ATC TTT CTG GGT TCC AGG CCC TGG ATG CCA GGG ACT CCC 

380 390 
arg arg leu pro arg leu pro gin arg tyr trp gin met arg pro 
CGC AGG TTG CCC CGC CTG CCC CAG CGC TAC TGG CAA ATG CGG CCC 

400 

leu phe leu glu leu leu gly asn his ala gin cys pro tyr gly 
CTG TTT CTG GAG CTG CTT GGG AAC CAC GCG CAG TGC CCC TAC GGG 

410 420 
val leu leu lys thr his cys pro leu arg ala ala val thr pro 
GTG CTC CTC AAG ACG CAC TGC CCG CTG CGA GCT GCG GTC ACC CCA 
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ala ala gly 
GCA GCC GGT 



ala pro glu 
GCC CCC GAG 



leu arg gin 
CTC CGC CAG 



ala cys leu 
GCC TGC CTG 



his asn glu 
CAC AAC GAA 



leu gly lys 
CTG GGG AAG 



met ser val 
ATG AGC GTG 



gly cys val 
GGC TGT GTT 



ala lys phe 
GCC AAG TTC 



leu arg ser 
CTC AGG TCT 



arg leu phe 
AGG CTC TTT 



ile gly ile 
ATT GGA ATC 

ser glu ala 
TCG GAA GCA 



leu thr ser 
CTG ACG TCC 



pro ile val 
CCG ATT GTG 



val cys ala 
GTC TGT GCC 

440 

glu glu asp 
GAG GAG GAC 



his ser ser 
CAC AGC AGC 

470 

arg arg leu 
CGC CGG CTG 



arg arg phe 
CGC CGC TTC 

500 

his ala lys 
CAT GCC AAG 



arg asp cys 
CGG GAC TGC 

530 

pro ala ala 
CCG GCC GCA 



leu his trp 
CTG CAC TGG 

560 

phe phe tyr 
TTC TTT TAT 



phe tyr arg 
TTC TAC CGG 

590 

arg gin his 
AGA CAG CAC 

glu val arg 
GAG GTC AGG 

620 

arg leu arg 
AGA CTC CGC 



asn met asp 
AAC ATG GAC 



430 
arg glu lys 
CGG GAG AAG 



thr asp pro 
AC A GAC CCC 

460 

pro trp gin 
CCC TGG CAG 



val pro pro 
GTG CCC CCA 

490 

leu arg asn 
CTC AGG AAC 



leu ser leu 
CTC TCG CTG 

520 

ala trp leu 
GCT TGG CTG 



glu his arg 
GAG CAC CGT 

550 

leu met ser 
CTG ATG AGT 



val thr glu 
GTC ACG GAG 

580 

pro ser val 
CCG AGT GTC 



leu lys arg 
TTG AAG AGG 
610 

gin his arg 
CAG CAT CGG 



phe ile pro 
TTC ATC CCC 

640 

tyr val val 
TAC GTC GTG 



pro gin gly 
CCC CAG GGC 



arg arg leu 
CGT CGC CTG 



val tyr gly 
GTG TAC GGC 



gly leu trp 
GGC CTC TGG 



thr lys lys 
ACC AAG AAG 



gin glu leu 
CAG GAG CTG 



arg arg ser 
CGC AGG AGC 



leu arg glu 
CTG CGT GAG 



val tyr val 
GTG TAC GTC 



thr thr phe 
ACC ACG TTT 



trp ser lys 
TGG AGC AAG 



val gin leu 
GTG CAG CTG 

glu ala arg 
GAA GCC AGG 



lys pro asp 
AAG CCT GAC 



gly ala arg 
GGA GCC AGA 



ser val ala 
TCT GTG GCG 

450 
val gin leu 
GTG CAG CTG 



phe val arg 
TTC GTG CGG 

480 
gly ser arg 
GGC TCC AGG 



phe ile ser 
TTC ATC TCC 

510 

thr trp lys 
ACG TGG AAG 



pro gly val 
CCA GGG GTT 

540 

glu ile leu 
GAG ATC CTG 



val glu leu 
GTC GAG CTG 

570 

gin lys asn 
CAA AAG AAC 



leu gin ser 
TTG CAA AGC 

600 

arg glu leu 
CGG GAG CTG 

pro ala leu 
CCC GCC CTG 

630 
gly leu arg 
GGG CTG CGG 



thr phe arg 
ACG TTC CGC 
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650 660 
arg glu lys arg ala glu arg leu thr ser arg val lys ala leu 
AGA GAA AAG AGG GCC GAG CGT CTC ACC TCG AGG GTG AAG GCA CTG 

670 

phe ser val leu asn tyr glu arg ala arg arg pro gly leu leu 
TTC AGC GTG CTC AAC TAC GAG CGG GCG CGG CGC CCC GGC CTC CTG 

680 690 
gly ala ser val leu gly leu asp asp ile his arg ala trp arg 
GGC GCC TCT GTG CTG GGC CTG GAC GAT ATC CAC AGG GCC TGG CGC 

700 

thr phe val leu arg val arg ala gin asp pro pro pro glu leu 
ACC TTC GTG CTG CGT GTG CGG GCC CAG GAC CCG CCG CCT GAG CTG 

710 720 
tyr phe val lys val asp val thr gly ala tyr asp thr ile pro 
TAC TTT GTC AAG GTG GAT GTG ACG GGC GCG TAC GAC ACC ATC CCC 

730 

gin asp arg leu thr glu val ile ala ser ile ile lys pro gin 
CAG GAC AGG CTC ACG GAG GTC ATC GCC AGC ATC ATC AAA CCC CAG 

740 750 
asn thr tyr cys val arg arg tyr ala val val gin lys ala ala 
AAC ACG TAC TGC GTG CGT CGG TAT GCC GTG GTC CAG AAG GCC GCC 

760 

his gly his val arg lys ala phe lys ser his val ser thr leu 
CAT GGG CAC GTC CGC AAG GCC TTC AAG AGC CAC GTC TCT ACC TTG 

770 780 
thr asp leu gin pro tyr met arg gin phe val ala his leu gin 
ACA GAC CTC CAG CCG TAC ATG CGA CAG TTC GTG GCT CAC CTG CAG 

790 

glu thr ser pro leu arg asp ala val val ile glu gin ser ser 
GAG ACC AGC CCG CTG AGG GAT GCC GTC GTC ATC GAG CAG AGC TCC 

800 810 
ser leu asn glu ala ser ser gly leu phe asp val phe leu arg 
TCC CTG AAT GAG GCC AGC AGT GGC CTC TTC GAC GTC TTC CTA CGC 

820 

phe met cys his his ala val arg ile arg gly lys ser tyr val 
TTC ATG TGC CAC CAC GCC GTG CGC ATC AGG GGC AAG TCC TAC GTC 

830 840 
gin cys gin gly ile pro gin gly ser ile leu ser thr leu leu 
CAG TGC CAG GGG ATC CCG CAG GGC TCC ATC CTC TCC ACG CTG CTC 

850 

cys ser leu cys tyr gly asp met glu asn lys leu phe ala gly 
TGC AGC CTG TGC TAC GGC GAC ATG GAG AAC AAG CTG TTT GCG GGG 

860 870 
ile arg arg asp gly leu leu leu arg leu val asp asp phe leu 
ATT CGG CGG GAC GGG CTG CTC CTG CGT TTG GTG GAT GAT TTC TTG 
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880 

leu val thr pro his leu thr his ala lys thr phe leu arg thr 
TTG GTG ACA CCT CAC CTC ACC CAC GCG AAA ACC TTC CTC AGG ACC 

890 900 
leu val arg gly val pro glu tyr gly cys val val asn leu arg 
CTG GTC CGA GGT GTC CCT GAG TAT GGC TGC GTG GTG AAC TTG CGG 

910 

lys thr val val asn phe pro val glu asp glu ala leu gly gly 
AAG ACA GTG GTG AAC TTC CCT GTA GAA GAC GAG GCC CTG GGT GGC 

920 930 
thr ala phe val gin met pro ala his gly leu phe pro trp cys 
ACG GCT TTT GTT CAG ATG CCG GCC CAC GGC CTA TTC CCC TGG TGC 

940 

gly leu leu leu asp thr arg thr leu glu val gin ser asp tyr 
GGC CTG CTG CTG GAT ACC CGG ACC CTG GAG GTG CAG AGC GAC TAC 

950 960 
ser ser tyr ala arg thr ser ile arg ala ser val thr phe asn 
TCC AGC TAT GCC CGG ACC TCC ATC AGA GCC AGT GTC ACC TTC AAC 

970 

ar 9 gly P^e lys ala gly arg asn met arg arg lys leu phe gly 
CGC GGC TTC AAG GCT GGG AGG AAC ATG CGT CGC AAA CTC TTT GGG 

980 990 
val leu arg leu lys cys his ser leu phe leu asp leu gin val 
GTC TTG CGG CTG AAG TGT CAC AGC CTG TTT CTG GAT TTG CAG GTG 

1000 

asn ser leu gin thr val cys thr asn ile tyr lys ile leu leu 
AAC AGC CTC CAG ACG GTG TGC ACC AAC ATC TAC AAG ATC CTC CTG 

1010 1020 
leu gin ala tyr arg phe his ala cys val leu gin leu pro phe 
CTG CAG GCG TAC AGG TTT CAC GCA TGT GTG CTG CAG CTC CCA TTT 

1030 

his gin gin val trp lys asn pro thr phe phe leu arg val ile 
CAT CAG CAA GTT TGG AAG AAC CCC ACA TTT TTC CTG CGC GTC ATC 

1040 1050 
ser asp thr ala ser leu cys tyr ser ile leu lys ala lys asn 
TCT GAC ACG GCC TCC CTC TGC TAC TCC ATC CTG AAA GCC AAG AAC 

1060 

ala gly met ser leu gly ala lys gly ala ala gly pro leu pro 
GCA GGG ATG TCG CTG GGG GCC AAG GGC GCC GCC GGC CCT CTG CCC 

1070 1080 
ser glu ala val gin trp leu cys his gin ala phe leu leu lys 
TCC GAG GCC GTG CAG TGG CTG TGC CAC CAA GCA TTC CTG CTC AAG 

1090 

leu thr arg his arg val thr tyr val pro leu leu gly ser leu 
CTG ACT CGA CAC CGT GTC ACC TAC GTG CCA CTC CTG GGG TCA CTC 
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1100 1110 
arg thr ala gin thr gin leu ser arg lys leu pro gly thr thr 
AGG ACA GCC CAG ACG CAG CTG AGT CGG AAG CTC CCG GGG ACG ACG 

1120 

leu thr ala leu glu ala ala ala asn pro ala leu pro ser asp 
CTG ACT GCC CTG GAG GCC GCA GCC AAC CCG GCA CTG CCC TCA GAC 

1130 1132 
phe lys thr ile leu asp OP 

TTC AAG ACC ATC CTG GAC TGA TGGCCACCCGCCCACAGCCAGGCCGAGAGCAGA 

CACCAGCAGCCCTGTCACGCCGGGCTCTACGTCCCAGGGAGGGAGGGGCGGCCCACACCC 

AGGCCCGCACCGCTGGGAGTCTGAGGCCTGAGTGAGTGTTTGGCCGAGGCCTGCATGTCC 

GGCTGAAGGCTGAGTGTCCGGCTGAGGCCTGAGCGAGTGTCCAGCCAAGGGCTGAGTGTC 

CAGCACACCTGCCGTCTTCACTTCCCCACAGGCTGGCGCTCGGCTCCACCCCAGGGCCAG 

CTTTTCYTCACCAGGAGCCCGGCTTCCACTCCCCACATAGGAATAGTCCATCCCCAGATT 

CGCCATTGTTCACCCYTCGCCCTGCCYTCCTTTGCCTTCCACCCCCACCATCCAGGTGGA 

GACCCTGAGAAGGACCCTGGGAGCTCTGGGAATTTGGAGTGACCAAAGGTGTGCCCTGTA 

CACAGGCGAGGACCCTGCACCTGGATGGGGGTCCCTGTGGGTCAAATTGGGGGGAGGTGC 

TGTGGGAGTAAAATACTGAATATATGAGTTTTTCAGTTTTGRAAAAAAAAAAAAAAAAAA 

AAAAAAAAAA 
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